0
0
bleve/analysis/language/cjk
Marty Schoch 043a3bfb7c change cjk analyzer to use unicode tokenizer
change cjk bigram analyzer to work with multi-rune terms
add cjk width filter replaces full unicode normailzation

these changes make the cjk analyzer behave more like elasticsearch
they also remove the depenency on the whitespace analyzer
which is now free to also behave more like lucene/es

fixes #33
2016-06-10 13:04:40 -04:00
..
analyzer_cjk_test.go change cjk analyzer to use unicode tokenizer 2016-06-10 13:04:40 -04:00
analyzer_cjk.go change cjk analyzer to use unicode tokenizer 2016-06-10 13:04:40 -04:00
cjk_bigram_test.go change cjk analyzer to use unicode tokenizer 2016-06-10 13:04:40 -04:00
cjk_bigram.go change cjk analyzer to use unicode tokenizer 2016-06-10 13:04:40 -04:00
cjk_width_test.go change cjk analyzer to use unicode tokenizer 2016-06-10 13:04:40 -04:00
cjk_width.go change cjk analyzer to use unicode tokenizer 2016-06-10 13:04:40 -04:00