0
0
Commit Graph

674 Commits

Author SHA1 Message Date
Marty Schoch
a1ce44fcd4 Merge pull request #284 from pmezard/add-indexmapping-fieldanalyzer
mapping_index: add IndexMapping.FieldAnalyzer()
2015-11-19 12:55:27 -05:00
Marty Schoch
46534d79ea Merge pull request #285 from pmezard/phrase-queries-need-term-vectors
mapping_field: document IncludeTermVectors
2015-11-19 12:53:38 -05:00
Patrick Mezard
1591ed1839 mapping_field: document IncludeTermVectors
And mention it is required for phrase and match phrase queries to
succeed.

Fix #280
2015-11-19 15:38:16 +01:00
Patrick Mezard
ff41dfec3d mapping_index: add IndexMapping.FieldAnalyzer()
It returns the name of the analyzer used on a field, which can be passed
to IndexMapping.AnalyzeText().

Fix #282
2015-11-19 11:20:32 +01:00
Marty Schoch
7dd52a5463 Merge pull request #279 from pmezard/expose-row-attributes
row: expose TermFrequencyRow term and freq fields
2015-11-17 12:24:16 -05:00
Patrick Mezard
e85c9c542e row: expose TermFrequencyRow term and freq fields
Rows content is an implementation detail of bleve index and may change
in the future. That said, they also contains information valuable to
assess the quality of the index or understand its performances. So, as
long as we agree that type asserting rows should only be done if you
know what you are doing and are ready to deal with future changes, I see
no reason to hide the row fields from external packages.

Fix #268
2015-11-17 17:21:26 +01:00
Marty Schoch
236cc156c5 Merge pull request #277 from marksamman/patch-1
Update NumericRangeQuery comments
2015-11-12 22:40:36 +00:00
Mark Samman
b3a1e21c12 Update NumericRangeQuery comments 2015-11-12 22:16:10 +01:00
Marty Schoch
7b22332b06 Merge pull request #271 from pmezard/docid-query
query_docid: add DocIDQuery to filter by document identifiers
2015-11-12 15:52:36 +00:00
Marty Schoch
bfbc2a6244 Merge pull request #273 from pmezard/document-token-map
token_map: document it along with stop_token_filter
2015-11-12 15:40:40 +00:00
Marty Schoch
eee4482964 Merge pull request #272 from kevgs/boltdb_optimize
BoltDB wrapper nano optimization which makes code a bit prettier too
2015-11-12 15:39:37 +00:00
Patrick Mezard
ff03874f19 token_map: document it along with stop_token_filter 2015-11-05 14:07:54 +01:00
Kosov Eugene
45e670b99b BoltDB wrapper nano optimization which makes code a bit prettier too 2015-11-05 00:27:28 +03:00
Patrick Mezard
19230b2f8a searcher_docid: catch DocIDReader.Close() possible error 2015-11-04 19:24:01 +01:00
Patrick Mezard
ff7234d893 query_docid: add DocIDQuery to filter by document identifiers 2015-11-04 18:41:16 +01:00
Marty Schoch
74c309a7c2 Merge pull request #270 from pmezard/fix-elision-filter
Fix elision filter
2015-11-04 07:36:59 -05:00
Patrick Mezard
eb26402924 elision_filter: correctly strip multi-bytes quotation marks 2015-11-04 10:59:10 +01:00
Patrick Mezard
bae2079eb2 token_filters: fix typo in right single quotation mark name 2015-11-04 10:29:56 +01:00
Marty Schoch
a873c76edc Merge pull request #265 from avsej/simplify-phrase-api
Simplify JSON API for phrase query
2015-11-02 15:42:41 -05:00
Sergey Avseyev
8609e7af08 Simplify JSON API for phrase query
New API looks like this:

    {"query":{"terms":["watered","down"],"field":"desc"}}

instead of

    {"query":{"terms":[{"term":"watered","field":"desc"},{"term":"down","field":"desc"}]}}

So that it eliminats accidental errors by supplying terms with different
fields, or different type of query
2015-11-02 21:08:11 +03:00
Marty Schoch
4791625b9b Merge pull request #262 from pmezard/index-and-tokenizer-doc-and-fix
Index and tokenizer doc and fix
2015-11-02 11:51:21 -05:00
Marty Schoch
30651065e9 fix panic on insufficiently sized buffer
adds test case to reproduce original problem
fixes #264
2015-10-30 18:25:38 -04:00
Patrick Mezard
0579d58263 mapping_test: fix TestMappingWithTokenizerDeps now patterns are required 2015-10-29 20:07:00 +01:00
Marty Schoch
2bd3ef4080 copy relevant k/v pairs before advancing underlying iterator 2015-10-28 12:23:54 -04:00
Marty Schoch
d1b07f4909 fix dump methods to properly copy keys and values 2015-10-28 12:06:44 -04:00
Patrick Mezard
f95f1d29a0 exception: fail if pattern is empty, name tokenizer in error 2015-10-27 18:53:03 +01:00
Patrick Mezard
54a85fa96a registry: improve error message upon forgotten "type" property
Registering a custom tokenizer while forgetting its "type" used to
return:

  error: unable to determine type

It now says:

  error: cannot resolve 'foo' tokenizer type: 'type' property is not defined
2015-10-27 18:53:03 +01:00
Patrick Mezard
8b17787a65 analysis: document "exception" tokenizer, and Tokenizer interface 2015-10-27 18:53:03 +01:00
Patrick Mezard
f2b3d5698e index: document TermFieldReader interface 2015-10-27 18:53:03 +01:00
Patrick Mezard
3df789d258 index: document empty strings behaviour when calling DocIDReader() 2015-10-27 18:53:03 +01:00
Marty Schoch
1a978a4591 fix go vet issues and cleanup reader/iterator 2015-10-26 16:41:58 -04:00
Marty Schoch
f0d282f5f8 add test case for seeing prefix iterators outside of range
similar to #256 except for prefix iterators
includes fix for boltdb and gtreap which had incorrect behavior
2015-10-26 16:14:29 -04:00
Marty Schoch
0ba164322b Merge pull request #261 from pmezard/improve-bleve-dump
bleve_dump: improve online help and error handling
2015-10-26 11:04:55 -04:00
Patrick Mezard
bc977048f7 bleve_dump: improve online help and error handling
I was expecting:

  $ bleve_dump -index /index -docID 123 -fields html

to print only the terms for "html" field in specified document. The
command now detects extra arguments and flag collisions.
2015-10-26 10:37:13 +01:00
Marty Schoch
f649998490 Merge pull request #257 from pmezard/pretty-print-queries
query: add DumpQuery to expand string query and format them as JSON
2015-10-23 09:28:36 -04:00
Patrick Mezard
c9619f0359 query: add DumpQuery to expand string query and format them as JSON
This is convenient to see either complicated queries build
programmatically, or to make sure the query parser does what it is
expected to do.

Note only queries made of bleve basic queries are supported. If we
wanted to support external queries, for instance string queries with an
alternative parser, I suggest to introduce some kind of:

type ExpandableQuery interface {
    Query
    Expand(*IndexMapping) (Query, error)
}

and type assert to that instead of *queryStringQuery.
2015-10-23 14:52:42 +02:00
Marty Schoch
89bc8c3a93 Merge pull request #253 from pmezard/document-index-interfaces
doc: document DocIDReader, and some Index bits
2015-10-20 14:47:35 -04:00
Patrick Mezard
5100e00f20 doc: DocIDReader.Advance() is no longer implementation dependent 2015-10-20 20:32:23 +02:00
Patrick Mezard
2fa334fc27 doc: talk about "documents" not "indexed or stored documents" 2015-10-20 20:24:24 +02:00
Patrick Mezard
b174c137fd doc: document DocIDReader, and some Index bits 2015-10-20 20:24:24 +02:00
Marty Schoch
74780b028e Merge pull request #256 from pmezard/fix-rangeiterator-seek
Fix rangeiterator seek
2015-10-20 14:08:57 -04:00
Patrick Mezard
da72d0c2b9 store_test: deduplicate store initialization 2015-10-20 19:21:01 +02:00
Patrick Mezard
873f483804 gtreap: RangeIterator.Seek should not move before start 2015-10-20 19:12:30 +02:00
Patrick Mezard
5d7628ba3b boltdb: fix RangeIterator outside of range seeks
Two issues:
- Seeking before i.start and iterating returned keys before i.start
- Seeking after the store last key did not invalidate the iterator and
  could cause infinite loops.
2015-10-20 19:09:51 +02:00
Patrick Mezard
aada2e7333 store_test: test RangeIterator.Seek on goleveldb 2015-10-20 19:09:38 +02:00
Marty Schoch
6cc21346dc fix errcheck issues 2015-10-19 14:27:03 -04:00
Marty Schoch
817c317c90 Merge branch 'master' into newkvstore 2015-10-19 12:04:07 -04:00
Marty Schoch
faceecf87b make row buffer size constant/configurable
also handle case where it is insufficiently sized
2015-10-19 12:03:38 -04:00
Marty Schoch
f0ee9a3c66 removed commented code and unused functions 2015-10-19 11:13:03 -04:00
Marty Schoch
c9471d5739 Merge pull request #244 from kevgs/master
reducing allocation count
2015-10-16 15:51:30 -04:00