0
0
Commit Graph

598 Commits

Author SHA1 Message Date
Patrick Mezard
ee8af9cfa3 doc: document field values storage and retrieval 2015-10-04 11:25:58 +02:00
Marty Schoch
e3a185b1c5 Merge pull request #242 from pmezard/add-boltdb-nosync-option
boltdb: add "nosync" option to force boltdb.DB.NoSync=true
2015-10-03 09:50:55 -04:00
Patrick Mezard
9d5407be13 boltdb: add "nosync" option to force boltdb.DB.NoSync=true
Use this option when rebuilding indexes from scratch. In my small case
(~17000 json documents), it reduces indexing from 520s to 250s.

I did not add any test, short of forced indexing termination it only
has performance effects, which are hard to test. And unknown options are
currently ignored.

Issue #240
2015-10-03 14:26:48 +02:00
Marty Schoch
6a9e2252c4 Merge pull request #241 from pmezard/document-field-analyzer-a-bit
Document field analyzer a bit
2015-10-02 13:12:56 -04:00
Patrick Mezard
2f48c16c84 doc: document IndexMapping.AddCustomAnalyzer 2015-10-02 17:38:07 +02:00
Patrick Mezard
ed1bdbf599 doc: document field analyzer resolution 2015-10-02 17:00:45 +02:00
Patrick Mezard
498e4a0de7 simplify FieldMapping.analyzerForField()
I stumbled onto that while trying to understand how analyzers are
resolved. The new code looks simpler to me and removes useless calls to
DocumentMapping.defaultAnalyzerName() when an analyzer is set at
FieldMapping level.

The slight change to TestStoredFieldPreserved avoids a stacktrace when
the test fails.
2015-10-02 15:45:43 +02:00
Marty Schoch
21473509b2 Merge branch 'Shugyousha-codesimplification' 2015-09-29 13:03:08 -04:00
Marty Schoch
da40935e22 Merge branch 'codesimplification' of https://github.com/Shugyousha/bleve into Shugyousha-codesimplification 2015-09-29 13:02:56 -04:00
Marty Schoch
09bde6ca87 Merge pull request #237 from pmezard/document-mapping-rules
doc: document indexed value/mappings interactions
2015-09-29 12:51:46 -04:00
Marty Schoch
1aac60a0f1 stop building wiki (retired) example project 2015-09-29 12:47:47 -04:00
Marty Schoch
6fa02b786d Merge pull request #236 from pmezard/implement-doc-static-mapping
implement document static mappings
2015-09-29 12:38:50 -04:00
Patrick Mezard
7db27aeba1 implement document static mappings
DocumentMapping.Dynamic was ignored by everything but the marshalling
code.

Issue #235
2015-09-29 11:32:36 +02:00
Marty Schoch
c3a4fab911 Merge pull request #238 from ikawaha/ja-morph-analyzer
fix compliation with the latest changes to kagome
2015-09-28 17:05:46 -04:00
Marty Schoch
cddf90c0ee don't allow operations on empty doc id
fixes #239
2015-09-28 17:00:08 -04:00
ikawaha
89af7978a9 fix compliation with the latest changes to kagome 2015-09-28 15:53:08 +09:00
Marty Schoch
1c9feaf792 fix backwards compatibility when index meta does not specify
the index type
2015-09-25 09:57:09 -07:00
Patrick Mezard
f72172a902 doc: document indexed value/mappings interactions
This is not the final word on this but it would have helped me a lot
starting with bleve. I left out details about value processing and
custom parsers. I also ignored that named FieldMapping can directly
resolve value fields without having a parent SubDocumentMapping because
it did not appear in any example I read.

Let's consider this as a starting point for documentation improvements.
2015-09-23 19:57:38 +02:00
Silvan Jegen
650d48427d Refactor AddFieldValue method
Removing one level of nesting makes the method easier to read.
2015-09-21 21:14:15 +02:00
Silvan Jegen
3414701fca Simplify returns 2015-09-21 20:47:10 +02:00
Silvan Jegen
f95a4662b1 Use Error type correctly 2015-09-21 20:47:10 +02:00
Steve Yen
ad82b4b33e updated link to simple query syntax in README.md 2015-09-17 10:27:58 -07:00
Marty Schoch
5897997906 update utils for new bleve config 2015-09-16 17:18:46 -04:00
Marty Schoch
f81b2be334 major refactor of bleve configuration
see #221 for full details
2015-09-16 17:10:59 -04:00
Marty Schoch
17c64d37c7 add similar benchmarks from firestorm 2015-09-10 08:13:52 -04:00
Marty Schoch
d00bc91dc9 minor speed up in token frequency calculations
benchmark               old ns/op     new ns/op     delta
BenchmarkAnalysis-4     1599218       1540991       -3.64%

benchmark               old allocs     new allocs     delta
BenchmarkAnalysis-4     5353           5318           -0.65%

benchmark               old bytes     new bytes     delta
BenchmarkAnalysis-4     370495        362983        -2.03%
2015-09-04 18:57:39 -04:00
Marty Schoch
0fab4b4583 fix nested document mappings, logic was broken
part of #228
2015-09-02 19:07:41 -04:00
Marty Schoch
dbb93b75a4 refactoring to allow pluggable index encodings
this lays the foundation for supporting the new firestorm
indexing scheme.  i'm merging these changes ahead of
the rest of the firestorm branch so i can continue
to make changes to the analysis pipeline in parallel
2015-09-02 13:12:08 -04:00
Marty Schoch
4840aaaa5a make analysis queue size changeable 2015-09-02 11:55:30 -04:00
Marty Schoch
7ad7659ce5 add support for using null kvstore outside of bleve internals 2015-09-02 11:50:06 -04:00
Marty Schoch
07d37ca38a add important rocksdb config options 2015-09-02 11:49:42 -04:00
Marty Schoch
3e60ca24ec support using end key on forestdb iterator for term freq lookup
also additoanl forestdb configs
2015-08-18 16:22:02 -04:00
Marty Schoch
d3dda3d0ea fixup config parsing and add new options 2015-08-12 13:18:23 -04:00
Marty Schoch
01667dfff3 faster protobufs with gogo 2015-08-12 13:18:23 -04:00
Marty Schoch
a4b44e25ca Merge pull request #227 from donhcd/customAnalyzerPackage
Move custom_analyzer to custom_analyzer package
2015-08-11 17:36:57 -04:00
Donald Huang
767831d87c move custom_analyzer to custom_analyzer package 2015-08-11 21:22:03 +00:00
Marty Schoch
7df66b4857 fix broken benchmark cause by index row encoding change 2015-08-06 14:48:04 -04:00
Marty Schoch
e255ff52b3 Merge pull request #223 from donhcd/fixTextFieldGoString
Fix invalid fmt string using analysis.Analyzer
2015-08-03 14:04:20 -04:00
Donald Huang
587bed442d fix invalid fmt string using analysis.Analyzer 2015-08-03 17:53:10 +00:00
Marty Schoch
7456842b04 Merge branch 'tukdesk-fix/MaxVarintLen64' 2015-07-31 15:16:34 -04:00
Marty Schoch
9db850a53e Merge branch 'fix/MaxVarintLen64' of https://github.com/tukdesk/bleve into tukdesk-fix/MaxVarintLen64 2015-07-31 15:16:16 -04:00
Marty Schoch
f35e2e42df fix highlighting to work on fields containing arrays
fixes #170
2015-07-31 14:43:12 -04:00
Marty Schoch
3682c25467 update to correctly work with composite fields
also updated search results to return array positions
2015-07-31 11:16:11 -04:00
Marty Schoch
70b23cbe7f Merge branch 'tukdesk-feature/term_vector' 2015-07-29 14:33:15 -04:00
Marty Schoch
c1c4941dde Merge branch 'feature/term_vector' of https://github.com/tukdesk/bleve into tukdesk-feature/term_vector 2015-07-29 14:31:15 -04:00
Marty Schoch
652c188fe8 exclude metrics 2015-07-28 19:10:49 -04:00
Marty Schoch
bf8dcae76b removing build tags 2015-07-28 18:59:10 -04:00
Marty Schoch
1f4ef3da8b move elision filter after lowercase filter
this affects all languages using the elision filter
languages fr and it are updated now
languages ca and ga are still missing other components and
do not yet have an analyzer, but they should follow this lead
once they are ready

fixes #218
2015-07-21 10:43:53 -04:00
Marty Schoch
2a8f319689 added test case for query string containing only MUST NOT clause 2015-07-13 15:30:19 -04:00
Marty Schoch
548b17310b Merge branch 'nimishzynga-test' 2015-07-13 15:26:05 -04:00