Salmān Aljammāz
9444af9366
arabic: add unicode normalization to analyzer
2015-02-06 19:50:58 +03:00
Salmān Aljammāz
91a8d5da9f
arabic: check minimum length before stemming
...
This invloves converting tokens to a rune slice in the filter, but
at least we're now compatable with Lucene's stemmer.
2015-02-06 19:50:58 +03:00
Salmān Aljammāz
0470f93955
arabic: add more stemmer tests
...
These came from org.apache.lucene.analysis.ar.
2015-02-06 19:49:30 +03:00
Marty Schoch
15139b8fa5
fix panic on mapping value containing null
...
added testcase to reproduce initial issue
closes #153
2015-02-05 16:15:05 -05:00
Marty Schoch
41cd64337b
Merge pull request #155 from saljam/ar_stemmer
...
arabic stemmer: strip multiple suffixes
2015-02-05 08:13:15 -05:00
Salmān Aljammāz
e461fed92a
arabic stemmer: strip multiple suffixes
...
updates #150
2015-02-05 16:07:58 +03:00
Marty Schoch
4be974f489
added first implementation of arabic analyzer
...
one test cases is not passing and is commented out temporarily
updates #150
2015-02-05 07:44:55 -05:00
Marty Schoch
b9c22fe50d
Merge pull request #154 from saljam/arabic
...
add arabic light stemmer
2015-02-05 07:09:54 -05:00
Salmān Aljammāz
945ef8158f
add arabic light stemmer
...
fixes #28
updates #150
2015-02-05 13:24:30 +03:00
Marty Schoch
dd1cd189a7
added initial implementation of hindi analyzer
...
closes #66
2015-02-04 15:12:08 -05:00
Steve Yen
38ee9be353
added some batch size 1000 microbenchmarks
2015-01-30 15:58:39 -08:00
Steve Yen
7d6a6aeaa8
single append for inmem KVStore batch
2015-01-29 11:14:08 -08:00
Steve Yen
5a30d36b17
cznicb KVStore uses Put() for faster read-modify-write
2015-01-29 11:02:01 -08:00
Steve Yen
b054cddf76
gtreap KVStore does 1 append for batch Set/Delete
2015-01-29 10:49:39 -08:00
Steve Yen
05d222f490
cznicb KVStore batch uses <2 appends per Set/Delete
2015-01-29 10:22:13 -08:00
Steve Yen
c5c59e61f4
make leveldb faster with non-zero sized batch
2015-01-29 10:20:26 -08:00
Steve Yen
1c1774d4ad
throw away data even faster in null KVStore
2015-01-29 10:17:21 -08:00
Marty Schoch
a9f153bac7
fix typo in unicode normalization form constant
...
also adjusted incorrect tests
fixes #149
2015-01-26 14:09:20 -05:00
Marty Schoch
ce44378b9c
only add highlighted fragments to map if successfully built some
2015-01-22 17:45:13 -05:00
Marty Schoch
a69fa1e91d
adding tests based on problems found with fosdem dataset
2015-01-22 09:57:26 -05:00
Marty Schoch
0ed47f5343
fix advance logic to not skip over result
2015-01-22 09:56:40 -05:00
Marty Schoch
5a09ceeac8
fix traversal logic when not in expected order
2015-01-22 09:56:21 -05:00
Marty Schoch
e59ed148c9
avoid crash when requesting more facets then available
2015-01-22 09:56:00 -05:00
Marty Schoch
944fae27f0
parse size and field for facets, print facet results
2015-01-22 09:55:20 -05:00
Marty Schoch
8891a3688d
mapping should consult default mapping for analyzers
2015-01-22 09:54:32 -05:00
Marty Schoch
29b4f9623a
fix query parser to recognize field prefix before fuzzy tilde
2015-01-19 16:20:32 -05:00
Marty Schoch
521d6101fd
fix issue identified by go vet
2015-01-19 15:50:07 -05:00
Marty Schoch
7e3ba85b9d
added test and fixed behavior to ensure correct value is stored
...
optimization introduced last week inadvertently meant we were
not preserving the original byte values of text fields that
were stored
2015-01-19 15:40:18 -05:00
Marty Schoch
3ea1894dea
fix rendering of non-text fields
2015-01-19 15:39:43 -05:00
Marty Schoch
3e4969efbc
change defaul fuzziness to 2 as that is more common
2015-01-19 14:08:22 -05:00
Marty Schoch
3e72949563
Merge branch 'StreamBoat-fix_ptr_handling'
2015-01-16 17:49:37 -05:00
Marty Schoch
50090dd199
added test for mapping struct with pointer to string
2015-01-16 17:49:01 -05:00
Marty Schoch
141585b806
Merge branch 'fix_ptr_handling' of https://github.com/StreamBoat/bleve into StreamBoat-fix_ptr_handling
2015-01-16 17:48:25 -05:00
Marty Schoch
8776d6c2b9
change behavior to ignore fields with JSON struct tag "-"
...
closes #146
2015-01-16 17:44:23 -05:00
Marty Schoch
89b1e2b18e
add unit test for handling of JSON struct tag "-"
...
see #146
2015-01-16 17:44:23 -05:00
Steve Yen
782ad94e01
added debug tag for metrics KVStore
2015-01-16 11:18:40 -08:00
Marty Schoch
eebc8e7825
more debuging around forestdb snapshots
2015-01-16 14:18:28 -05:00
Marty Schoch
ba978ea27e
improving log messages
2015-01-16 14:07:47 -05:00
Marty Schoch
8afbd485f9
also print out the requested stored fields
2015-01-16 13:35:43 -05:00
Marty Schoch
09fe749913
default to autocompaction for forestdb
2015-01-16 13:35:43 -05:00
Steve Yen
12dc2aff93
add go1.4 build tag to cznicb KVStore
...
This is because github.com/cznic/b depends on sync.Pool.
2015-01-15 15:54:25 -08:00
Steve Yen
11ee0209ad
no leading zeros for metrics CSV output
2015-01-15 15:09:53 -08:00
Steve Yen
202191201c
added WriteCSV() to metrics KVStore
2015-01-15 14:11:15 -08:00
Steve Yen
9be4e217bc
metrics KVStore tracks perf metrics on a wrapped KVStore
2015-01-15 11:42:41 -08:00
Peter Fern
7480409cc4
Process pointer fields, rather than trying to walk them
...
Pointers may be references to any type, existing logic will only handle
types understood by `walkDocument`, instead pass the deref'd element to
`processProperty`, where it can get passed back to `walkDocument` if
necessary, or be processed as a regular field.
2015-01-15 16:15:50 +11:00
Marty Schoch
530613a239
rewrite map access to take advantage of optimization
2015-01-14 12:57:34 -05:00
Marty Schoch
890b1abfe6
new version of lower case filter which tries to avoid copying bytes
2015-01-14 11:34:30 -05:00
Marty Schoch
7cc544adf2
switched to bytes.ToLower for minor speedup
2015-01-14 09:28:57 -05:00
Marty Schoch
f000092201
added benchmark for lowercase filter
2015-01-14 09:28:57 -05:00
Steve Yen
ea0a8657f3
added cznicb in-memory kvstore (no reader isolation)
2015-01-13 17:35:28 -08:00