![]() many of these defaults were arbitrary, and not having defaults lets us more easily flag them for configuration added a shingle filter introduce new toke type for shingles |
||
---|---|---|
.. | ||
regexp_tokenizer | ||
single_token | ||
unicode_word_boundary | ||
whitespace_tokenizer |