Marty Schoch
62645f10a2
Merge pull request #179 from gsathya/add_index_tests
...
Add tests for Index
2015-03-19 16:56:45 -04:00
Sathyanarayanan Gunasekaran
5c7aa21643
Add test for index.Stats
2015-03-19 14:06:59 -04:00
Sathyanarayanan Gunasekaran
d9a7a2e3a0
Add test for index.FieldDictPrefix
2015-03-19 14:06:59 -04:00
Sathyanarayanan Gunasekaran
5b4ee3e598
Add test for index.FieldDictRange
2015-03-19 14:06:59 -04:00
Marty Schoch
6f185f8cc0
fix highlighting bug when terms overlap (ngram analysis)
...
fixes #178
2015-03-18 14:34:47 -04:00
Marty Schoch
a41f229b14
added regexp and wildcard queries
...
fixes #152
2015-03-11 16:57:22 -04:00
Marty Schoch
183fcd4b14
added a missing check for errors
2015-03-11 16:56:01 -04:00
Marty Schoch
a44a7c01af
rewrite to used fixed size []byte instead of buffer
...
removes unchecked errors in calls to buffer.Write
and also benchmarks considerably faster
2015-03-11 15:12:13 -04:00
Marty Schoch
50bd082257
fixed issues with portuguese analyzer
...
fixes #70
2015-03-11 14:22:11 -04:00
Marty Schoch
7970f42c29
fix issues with italian analyzer
...
switch it to not require icu/libstemmer
fixes #69
2015-03-11 11:48:13 -04:00
Marty Schoch
eeaf514848
switch fr to not require icu/libstemmer
...
also corrected copy/paste bug in test
2015-03-11 11:46:33 -04:00
Marty Schoch
8ae30fb6f0
fix issues with lucene stemmer
...
fixes issue #68
2015-03-11 11:14:29 -04:00
Marty Schoch
b5a79c8ecc
Merge pull request #173 from gsathya/fix_return_err
...
Check all return errors
2015-03-11 08:30:42 -04:00
Sathyanarayanan Gunasekaran
93e749bc0c
Check all return errors
...
- Fix the following errors found by errcheck :
$ bleve git:(master) errcheck github.com/blevesearch/bleve
github.com/blevesearch/bleve/index_impl.go:206:25 defer indexReader.Close()
github.com/blevesearch/bleve/index_impl.go:317:25 defer indexReader.Close()
github.com/blevesearch/bleve/index_impl.go:353:25 defer indexReader.Close()
github.com/blevesearch/bleve/index_impl.go:359:22 defer searcher.Close()
github.com/blevesearch/bleve/index_impl.go:497:25 defer indexReader.Close()
github.com/blevesearch/bleve/index_impl.go:644:20 defer reader.Close()
github.com/blevesearch/bleve/index_meta.go:67:27 defer indexMetaFile.Close()
2015-03-11 01:28:51 -04:00
Marty Schoch
522f9d5cc7
significant change to index format, support dictionary rows
...
this introduces disk format v4
now the summary rows for a term are stored in their own
"dictionary row" format, previously the same information
was stored in special term frequency rows
this now allows us to easily iterate all the terms for a field
in sorted order (useful for many other fuzzy data structures)
at the top-level of bleve you can now browse terms within a field
using the following api on the Index interface:
FieldDict(field string) (index.FieldDict, error)
FieldDictRange(field string, startTerm []byte, endTerm []byte) (index.FieldDict, error)
FieldDictPrefix(field string, termPrefix []byte) (index.FieldDict, error)
fixes #127
2015-03-10 16:22:19 -04:00
Marty Schoch
4e14f4e4ef
change path for forestdb test to correctly cleanup
...
this is due to forestdb auto-compaction using the provided
path as just the prefix, so if we're not careful we end
up with many stray files laying around
here, we create a sub-directory first, and just nuke the
whole subdir when we're done
2015-03-10 14:05:58 -04:00
Marty Schoch
0df0a6fcb2
better logging on which test failed in integration tests
2015-03-10 14:05:30 -04:00
Marty Schoch
18dabdb946
fix compilation of bulk index utility
2015-03-09 08:20:40 -04:00
Marty Schoch
af356acff0
changed batch behavior
...
now created through the index itself
mapping problems reported early at the time
data is added to the batch, previously these
were not reported until the batch was executed
2015-03-09 08:20:39 -04:00
Marty Schoch
eaccd74c93
Merge pull request #134 from Shugyousha/numfacet
...
Add a benchmark for the numeric facet builder and use sort.Sort in it (just like for the terms one)
2015-03-06 14:50:30 -05:00
Marty Schoch
300ec79c96
first pass at checking errors that were ignored
...
part of #169
2015-03-06 14:46:29 -05:00
Marty Schoch
a2ad7634f2
update term freq rows to use varint where possible
...
benchmark old ns/op new ns/op delta
BenchmarkLevelDBIndexing1Workers 1138292 657901 -42.20%
BenchmarkLevelDBIndexing2Workers 1619323 647628 -60.01%
BenchmarkLevelDBIndexing4Workers 1172845 636478 -45.73%
BenchmarkLevelDBIndexing1Workers10Batch 465556545 448153394 -3.74%
BenchmarkLevelDBIndexing2Workers10Batch 504203911 449657355 -10.82%
BenchmarkLevelDBIndexing4Workers10Batch 510766435 439839335 -13.89%
BenchmarkLevelDBIndexing1Workers100Batch 307657846 268976464 -12.57%
BenchmarkLevelDBIndexing2Workers100Batch 302257400 269110215 -10.97%
BenchmarkLevelDBIndexing4Workers100Batch 305320485 259084902 -15.14%
BenchmarkLevelDBIndexing1Workers1000Batch 301320576 258070231 -14.35%
BenchmarkLevelDBIndexing2Workers1000Batch 334174454 261175641 -21.84%
BenchmarkLevelDBIndexing4Workers1000Batch 267732436 261461739 -2.34%
closes #165
2015-03-06 13:00:53 -05:00
Marty Schoch
ee1210bc82
updated bleve_query to allow benchmarking single queries
2015-03-06 13:00:11 -05:00
Marty Schoch
7284c10020
added benchmark to collector
2015-03-06 12:59:44 -05:00
Marty Schoch
5ac9583370
fix godoc around NewFuzzyQuery function
...
closes #166
2015-03-02 09:29:06 -05:00
Marty Schoch
221664977d
Merge pull request #167 from gitter-badger/gitter-badge
...
Add a Gitter chat badge to README.md
2015-03-02 09:23:51 -05:00
The Gitter Badger
1b91caa536
Added Gitter badge
2015-02-23 19:54:11 +00:00
Marty Schoch
0771f813ce
SearchResult Took field now returns full time in Search()
...
likewise, MultiSearch used by aliases spanning multiple
will also return full time in MultiSearch()
closes #163
2015-02-19 12:11:40 +05:30
Marty Schoch
daeaa2c129
fix bad math in multi search, and return original reqest in res
...
related to #164
2015-02-18 17:24:22 +05:30
Marty Schoch
c566d34264
bump index format version number, start checking version on open
2015-02-17 17:16:31 +05:30
Marty Schoch
251e4717a2
Merge pull request #161 from nimishzynga/master
...
Minor change in example code in README
2015-02-17 11:47:55 +05:30
Nimish Gupta
1ce81c2f85
Minor change in example code in README
2015-02-17 11:40:07 +05:30
Marty Schoch
afbda14dc2
fix mapping analyzer lookup with multiple fields
...
closes #159
2015-02-13 08:46:36 -05:00
Marty Schoch
2f607d73f3
Merge pull request #157 from saljam/arabic
...
Further improvements to the Arabic analyzer
2015-02-11 15:49:19 -05:00
Marty Schoch
fb0db7066f
improve highlighting with multi-byte characters
...
fixes #156
2015-02-06 12:35:01 -05:00
Salmān Aljammāz
9444af9366
arabic: add unicode normalization to analyzer
2015-02-06 19:50:58 +03:00
Salmān Aljammāz
91a8d5da9f
arabic: check minimum length before stemming
...
This invloves converting tokens to a rune slice in the filter, but
at least we're now compatable with Lucene's stemmer.
2015-02-06 19:50:58 +03:00
Salmān Aljammāz
0470f93955
arabic: add more stemmer tests
...
These came from org.apache.lucene.analysis.ar.
2015-02-06 19:49:30 +03:00
Marty Schoch
15139b8fa5
fix panic on mapping value containing null
...
added testcase to reproduce initial issue
closes #153
2015-02-05 16:15:05 -05:00
Marty Schoch
41cd64337b
Merge pull request #155 from saljam/ar_stemmer
...
arabic stemmer: strip multiple suffixes
2015-02-05 08:13:15 -05:00
Salmān Aljammāz
e461fed92a
arabic stemmer: strip multiple suffixes
...
updates #150
2015-02-05 16:07:58 +03:00
Marty Schoch
4be974f489
added first implementation of arabic analyzer
...
one test cases is not passing and is commented out temporarily
updates #150
2015-02-05 07:44:55 -05:00
Marty Schoch
b9c22fe50d
Merge pull request #154 from saljam/arabic
...
add arabic light stemmer
2015-02-05 07:09:54 -05:00
Salmān Aljammāz
945ef8158f
add arabic light stemmer
...
fixes #28
updates #150
2015-02-05 13:24:30 +03:00
Marty Schoch
dd1cd189a7
added initial implementation of hindi analyzer
...
closes #66
2015-02-04 15:12:08 -05:00
Steve Yen
38ee9be353
added some batch size 1000 microbenchmarks
2015-01-30 15:58:39 -08:00
Steve Yen
7d6a6aeaa8
single append for inmem KVStore batch
2015-01-29 11:14:08 -08:00
Steve Yen
5a30d36b17
cznicb KVStore uses Put() for faster read-modify-write
2015-01-29 11:02:01 -08:00
Steve Yen
b054cddf76
gtreap KVStore does 1 append for batch Set/Delete
2015-01-29 10:49:39 -08:00
Steve Yen
05d222f490
cznicb KVStore batch uses <2 appends per Set/Delete
2015-01-29 10:22:13 -08:00