Marty Schoch
17c64d37c7
add similar benchmarks from firestorm
2015-09-10 08:13:52 -04:00
Marty Schoch
1e4d637761
adding more benchmarks
2015-09-10 08:01:11 -04:00
Marty Schoch
37aa5cb027
Merge branch 'master' into firestorm
2015-09-09 09:03:42 -04:00
Marty Schoch
d00bc91dc9
minor speed up in token frequency calculations
...
benchmark old ns/op new ns/op delta
BenchmarkAnalysis-4 1599218 1540991 -3.64%
benchmark old allocs new allocs delta
BenchmarkAnalysis-4 5353 5318 -0.65%
benchmark old bytes new bytes delta
BenchmarkAnalysis-4 370495 362983 -2.03%
2015-09-04 18:57:39 -04:00
Marty Schoch
0fab4b4583
fix nested document mappings, logic was broken
...
part of #228
2015-09-02 19:07:41 -04:00
Marty Schoch
f74ed6a9ae
Merge remote-tracking branch 'origin' into firestorm
...
cathching up with changes from master
2015-09-02 13:29:03 -04:00
Marty Schoch
dbb93b75a4
refactoring to allow pluggable index encodings
...
this lays the foundation for supporting the new firestorm
indexing scheme. i'm merging these changes ahead of
the rest of the firestorm branch so i can continue
to make changes to the analysis pipeline in parallel
2015-09-02 13:12:08 -04:00
Marty Schoch
4840aaaa5a
make analysis queue size changeable
2015-09-02 11:55:30 -04:00
Marty Schoch
7ad7659ce5
add support for using null kvstore outside of bleve internals
2015-09-02 11:50:06 -04:00
Marty Schoch
07d37ca38a
add important rocksdb config options
2015-09-02 11:49:42 -04:00
Marty Schoch
18151862b5
fix go vet issues
2015-08-25 15:13:13 -04:00
Marty Schoch
84811cf5a0
made index type configurable + first version of firestorm
2015-08-25 14:52:42 -04:00
Marty Schoch
3e60ca24ec
support using end key on forestdb iterator for term freq lookup
...
also additoanl forestdb configs
2015-08-18 16:22:02 -04:00
Marty Schoch
ae19d77b04
updated protobuf defs to be valid
2015-08-17 15:37:13 -04:00
Marty Schoch
1187436e46
changed Stored row Values to also use protobuf
2015-08-17 09:48:40 -04:00
Marty Schoch
8d8a05a842
fix more issues
2015-08-14 16:27:00 -04:00
Marty Schoch
e0802a2b39
fixed the worst of the formatting
2015-08-14 16:17:48 -04:00
Marty Schoch
f4df56eb7c
add first draft of firestorm proposal
2015-08-14 16:09:19 -04:00
Marty Schoch
d3dda3d0ea
fixup config parsing and add new options
2015-08-12 13:18:23 -04:00
Marty Schoch
01667dfff3
faster protobufs with gogo
2015-08-12 13:18:23 -04:00
Marty Schoch
a4b44e25ca
Merge pull request #227 from donhcd/customAnalyzerPackage
...
Move custom_analyzer to custom_analyzer package
2015-08-11 17:36:57 -04:00
Donald Huang
767831d87c
move custom_analyzer to custom_analyzer package
2015-08-11 21:22:03 +00:00
Marty Schoch
7df66b4857
fix broken benchmark cause by index row encoding change
2015-08-06 14:48:04 -04:00
Marty Schoch
e255ff52b3
Merge pull request #223 from donhcd/fixTextFieldGoString
...
Fix invalid fmt string using analysis.Analyzer
2015-08-03 14:04:20 -04:00
Donald Huang
587bed442d
fix invalid fmt string using analysis.Analyzer
2015-08-03 17:53:10 +00:00
Marty Schoch
7456842b04
Merge branch 'tukdesk-fix/MaxVarintLen64'
2015-07-31 15:16:34 -04:00
Marty Schoch
9db850a53e
Merge branch 'fix/MaxVarintLen64' of https://github.com/tukdesk/bleve into tukdesk-fix/MaxVarintLen64
2015-07-31 15:16:16 -04:00
Marty Schoch
f35e2e42df
fix highlighting to work on fields containing arrays
...
fixes #170
2015-07-31 14:43:12 -04:00
Marty Schoch
3682c25467
update to correctly work with composite fields
...
also updated search results to return array positions
2015-07-31 11:16:11 -04:00
Marty Schoch
70b23cbe7f
Merge branch 'tukdesk-feature/term_vector'
2015-07-29 14:33:15 -04:00
Marty Schoch
c1c4941dde
Merge branch 'feature/term_vector' of https://github.com/tukdesk/bleve into tukdesk-feature/term_vector
2015-07-29 14:31:15 -04:00
Marty Schoch
652c188fe8
exclude metrics
2015-07-28 19:10:49 -04:00
Marty Schoch
bf8dcae76b
removing build tags
2015-07-28 18:59:10 -04:00
Marty Schoch
1f4ef3da8b
move elision filter after lowercase filter
...
this affects all languages using the elision filter
languages fr and it are updated now
languages ca and ga are still missing other components and
do not yet have an analyzer, but they should follow this lead
once they are ready
fixes #218
2015-07-21 10:43:53 -04:00
Marty Schoch
2a8f319689
added test case for query string containing only MUST NOT clause
2015-07-13 15:30:19 -04:00
Marty Schoch
548b17310b
Merge branch 'nimishzynga-test'
2015-07-13 15:26:05 -04:00
Marty Schoch
e8e1556996
Merge branch 'test' of https://github.com/nimishzynga/bleve into nimishzynga-test
2015-07-13 15:23:51 -04:00
Marty Schoch
1b28f6218b
additional row validation
2015-07-13 15:22:54 -04:00
Marty Schoch
ad0d580587
add option to bleve_create to select storage
2015-07-13 15:19:56 -04:00
Marty Schoch
17ef48f82a
switching back to the canonical goleveldb repo
2015-07-08 12:21:17 -06:00
Nimish Gupta
77779b70f4
Allow query string handling which contains only not must
...
fixes #193
2015-07-08 16:36:14 +05:30
Marty Schoch
bf80f4628e
fix bug in curent goleveldb (must copy during iteration)
...
also changed over to mschoch fork of goleveldb (temporary)
the change to my fork is pending some read-only issues described
here: https://github.com/syndtr/goleveldb/issues/111
hopefully we can find a path forward, and get that addressed upstream
2015-07-06 18:00:05 -04:00
Marty Schoch
7f0961424d
updated tests for <mark></mark>
2015-07-06 18:00:05 -04:00
Marty Schoch
e2223f5121
changed HTML highlighter to use html mark tag
2015-07-06 18:00:05 -04:00
Marty Schoch
65556f45c7
added additional tests for bug #214
2015-07-06 18:00:05 -04:00
Marty Schoch
7be7ecdf8e
fix batch indexing bug, incremented docCount before commit
...
fixes #211
2015-06-08 14:14:05 -04:00
Marty Schoch
2768c2da3c
fix previous sloppy fix which hadn't been adequately tested
2015-05-27 19:15:55 -07:00
Marty Schoch
201fb91171
fix up to correctly trim off separator
...
even though it should never be present
2015-05-27 19:10:12 -07:00
Marty Schoch
a58592ceff
fix case where NewBackIndexRowKV returns nil, nil
...
the logic for reading the docID from the keys
in this row relies on the keys NEVER containing
the byte separator character (0xff), this is OK
as we require that all keys be valid utf-8
however, it turns out that in the case where this
rule was violated, we would panic, because we
return nil, nil and later try to print the doc id
2015-05-27 19:04:57 -07:00
dtynn
59c97ae577
use binary.MaxVarintLen64
2015-05-26 15:35:31 +08:00