bleve

Author	SHA1	Message	Date
Salmān Aljammāz	9444af9366	arabic: add unicode normalization to analyzer	2015-02-06 19:50:58 +03:00
Salmān Aljammāz	91a8d5da9f	arabic: check minimum length before stemming This invloves converting tokens to a rune slice in the filter, but at least we're now compatable with Lucene's stemmer.	2015-02-06 19:50:58 +03:00
Salmān Aljammāz	0470f93955	arabic: add more stemmer tests These came from org.apache.lucene.analysis.ar.	2015-02-06 19:49:30 +03:00
Salmān Aljammāz	e461fed92a	arabic stemmer: strip multiple suffixes updates #150	2015-02-05 16:07:58 +03:00
Marty Schoch	4be974f489	added first implementation of arabic analyzer one test cases is not passing and is commented out temporarily updates #150	2015-02-05 07:44:55 -05:00
Salmān Aljammāz	945ef8158f	add arabic light stemmer fixes #28 updates #150	2015-02-05 13:24:30 +03:00
Marty Schoch	1dc466a800	modified token filters to avoid creating new token stream often the result stream was the same length, so can reuse the existing token stream also, in cases where a new stream was required, set capacity to the length of the input stream. most output stream are at least as long as the input, so this may avoid some subsequent resizing	2014-09-23 18:41:32 -04:00
Marty Schoch	d534b0836b	converted ALL_CAPS constants to CamelCase	2014-09-03 17:48:40 -04:00
Marty Schoch	7a7eb2e94c	add newline between license and package this avoids cluttering godocs with the license	2014-09-02 10:54:50 -04:00
Marty Schoch	1161361bea	rename imports from couchbaselabs to blevesearch	2014-08-28 15:38:57 -04:00
Marty Schoch	c526a38369	major refactor of analysis files, now wired up to registry ultimately this is make it more convenient for us to wire up different elements of the analysis pipeline, without having to preload everything into memory before we need it separately the index layer now has a mechanism for storing internal key/value pairs. this is expected to be used to store the mapping, and possibly other pieces of data by the top layer, but not exposed to the user at the top.	2014-08-13 21:14:47 -04:00

11 Commits