0
0
bleve/analysis
Marty Schoch 9e9f172f81 speed up english possessive filter
previous impl always did full utf8 decode of rune
if we assume most tokens are not possessive this is unnecessary
and even if they are, we only need to chop off last to runes
so, now we only decode last rune of token, and if it looks like
s/S then we proceed to decode second to last rune, and then
only if it looks like any form of apostrophe, do we make any
changes to token, again by just reslicing original to chop
off the possessive extension
2016-09-11 12:55:03 -04:00
..
analyzers change "simple" analyzer to use "letter" tokenizer 2016-03-31 15:13:17 -04:00
char_filters add newline between license and package 2014-09-02 10:54:50 -04:00
datetime_parsers major refactor of bleve configuration 2015-09-16 17:10:59 -04:00
language speed up english possessive filter 2016-09-11 12:55:03 -04:00
token_filters add couchbase copyright and license now that CLA has been signed 2016-06-10 13:08:50 -04:00
token_map token_map: document it along with stop_token_filter 2015-11-05 14:07:54 +01:00
tokenizers updated whtitepsace to behave more like lucene/es 2016-06-10 15:30:43 -04:00
benchmark_test.go analyze locations only if includeTermVectors enabled 2016-01-05 12:46:46 -08:00
freq_test.go gofmt simplifications 2016-04-02 21:54:33 -04:00
freq.go copy locations on merge for more safe/predictable behavior 2016-01-19 14:21:48 -05:00
test_words.txt major refactor of analysis files, now wired up to registry 2014-08-13 21:14:47 -04:00
token_map_test.go fix issues identified by errcheck 2015-04-07 14:52:00 -04:00
token_map.go added some godoc documentation for the en analyzer 2015-11-18 15:28:57 +13:00
type.go Implemented boolean field support 2016-01-11 17:18:03 -08:00
util_test.go add newline between license and package 2014-09-02 10:54:50 -04:00
util.go fix issues with lucene stemmer 2015-03-11 11:14:29 -04:00