bleve

Author	SHA1	Message	Date
Marty Schoch	5aa9e95468	major refactor of index/search API index id's are now opaque (until finally returned to top-level user) - the TermFieldDoc's returned by TermFieldReader no longer contain doc id - instead they return an opaque IndexInternalID - items returned are still in the "natural index order" - but that is no longer guaranteed to be "doc id order" - correct behavior requires that they all follow the same order - but not any particular order - new API FinalizeDocID which converts index internal ID's to public string ID - APIs used internally which previously took doc id now take IndexInternalID - that is DocumentFieldTerms() and DocumentFieldTermsForFields() - however, APIs that are used externally do not reflect this change - that is Document() - DocumentIDReader follows the same changes, but this is less obvious - behavior clarified, used to iterate doc ids, BUT NOT in doc id order - method STILL available to iterate doc ids in range - but again, you won't get them in any meaningful order - new method to iterate actual doc ids from list of possible ids - this was introduced to make the DocIDSearcher continue working searchers now work with the new opaque index internal doc ids - they return new DocumentMatchInternal (which does not have string ID) scorerers also work with these opaque index internal doc ids - they return DocumentMatchInternal (which does not have string ID) collectors now also perform a final step of converting the final result - they STILL return traditional DocumentMatch (with string ID) - but they now also require an IndexReader (so that they can do the conversion)	2016-07-31 13:46:18 -04:00
Marty Schoch	47ee69ae82	term field reader supports optionally omitting 3 details at the time you create the term field reader, you can specify that you don't need the term freq, the norm, or the term vectors in that case, the index implementation can choose to not return them in its subsequently returned values this is advisory only, some simple implementations may ignore this and continue to return the values anyway (as the current impl of upside_down does today) this change will allow future index implementations the opportunity to do less work when it isn't required	2016-07-30 10:26:42 -04:00
Steve Yen	4822cff63a	optimize Advance() with pre-allocated in-out param This perf-related change helps the code and API reach more similarity with the Next() methods, which now take a pre-allocate param.	2016-07-29 14:15:00 -07:00
Steve Yen	3c82086805	optimize upside_down reader & 64-bit struct alignments The UpsideDownCouchTermFieldReader.Next() only needs the doc ID from the key, so this change provides a specialized parseKDoc() method for that optimization. Additionally, fields in various structs are more 64-bit aligned, in an attempt to reduce the invocations of runtime.typedmemmove() and runtime.heapBitsBulkBarrier(), which the go compiler seems to automatically insert to transparently handle misaligned data.	2016-07-23 10:37:40 -07:00
Steve Yen	5094d2d097	optimize moss PrefixIterator Previously, the PrefixIterator() for moss was implemented by comparing the prefix bytes on every Next(). With this optimization, the next larger endKeyExclusive is computed at the iterator's initialization, which allows us to avoid all those prefix comparisons.	2016-07-21 18:33:34 -07:00
Steve Yen	5271a0f62b	optimize termFieldVectorsFromTermVectors when empty	2016-07-21 11:46:14 -07:00
Steve Yen	cbb174b074	optimize moss iterator Next() done/k/v maintenance	2016-07-21 11:10:49 -07:00
Steve Yen	b744148449	optimization to actually reuse the TermFrequencyRow	2016-07-21 11:10:49 -07:00
Steve Yen	6d7fa0b964	optimize moss iterator checkDone()	2016-07-21 11:10:49 -07:00
Steve Yen	39d3e2f028	optimize upside_down reader Next() with TermFieldDoc reuse This optimization changes the index.TermFieldReader.Next() interface API, adding an optional, pre-allocated *TermFieldDoc parameter, which can help prevent garbage creation.	2016-07-21 11:10:49 -07:00
Steve Yen	2498ccc913	optimize upside_down reader Next() to reuse TermFrequencyRow Before this change, upside down's reader would alloc a new TermFrequencyRow on every Next(), which would be immediately transformed into an index.TermFieldDoc{}. This change reuses a pre-allocated TermFrequencyRow that's a field in the reader.	2016-07-21 11:10:49 -07:00
Steve Yen	68af6aef62	optimize upside_down reader Next() when 0-length term field vectors From some bleve-query perf profiling, term field vectors appeared to be alloc'ed, which was unnecessary as term field vectors are disabled in the bleve-blast/bleve-query tests.	2016-07-21 11:10:49 -07:00
Marty Schoch	5934a185f3	Merge pull request #398 from slavikm/master Make facets much faster	2016-07-21 09:12:28 -04:00
slavikm	fc990bc2d1	Remove the field IDs from outside of the index	2016-07-19 20:42:45 -07:00
slavikm	ce64c17be1	Do field cache only once per search	2016-07-17 16:29:17 -07:00
slavikm	9a9b630a6d	Make facets much faster	2016-07-17 15:31:35 -07:00
Steve Yen	80623f4a8a	MB-20101 - moss KV fix Get() of 0-length vals The moss KV store adapter's Get() implementation was incorrectly transforming a 0-length val (e.g., []byte{}) into a nil val.	2016-07-15 14:41:30 -07:00
Marty Schoch	bd2a23fb6d	remove firestorm index scheme firestorm was an experiment we learned a lot, but it did not result in a usable index scheme	2016-06-26 07:51:41 -04:00
Mark Mindenhall	c3c827aded	Add boltdb config test	2016-06-14 13:36:40 -06:00
Mark Mindenhall	d369bd5c3c	Add bucket fill percent option for boltdb	2016-06-13 18:47:38 -06:00
Marty Schoch	1be5699c54	Merge pull request #381 from MachineShop-IOT/master Compact for boltdb (workaround for #374)	2016-06-08 00:01:20 -04:00
Steve Yen	4e531ae11b	configurable mossStoreOptions and DeferredSort defaults to true	2016-06-07 17:38:43 -07:00
Mark Mindenhall	09fcc69516	rename defaultBatchSize to defaultCompactBatchSize	2016-06-01 14:25:57 -06:00
Mark Mindenhall	b5a4378a46	Cleanup godoc comments in PR	2016-06-01 13:59:57 -06:00
Mark Mindenhall	fecf7ab5c4	Compact for boltdb (workaround for #374 )	2016-06-01 13:16:43 -06:00
Marty Schoch	92cf2a8974	Merge pull request #376 from MachineShop-IOT/master Remove DictionaryTerm with count 0 during compact (workaround for #374)	2016-06-01 13:39:30 -04:00
Steve Yen	bf318b489b	enable mossStore as configurable lower-level store Also, bumped moss vendor SHA to latest moss with mossStore.	2016-05-26 13:33:22 -07:00
Mark Mindenhall	04351eb8f1	Move creation of iterator within transaction	2016-05-26 12:29:49 -06:00
Mark Mindenhall	686b20be4f	Remove DictionaryTerm with count 0 during compact (workaround for #374 )	2016-05-26 11:04:53 -06:00
Mark Mindenhall	3aa1d72233	Add compact method to goleveldb store	2016-05-17 16:58:17 -06:00
Marty Schoch	73b514fa4f	do not put +/-Inf or NaN values into the stats map	2016-04-15 13:39:30 -04:00
Marty Schoch	b8a2fbb887	fix data race in bleve batch reuse Currently bleve batch is build by user goroutine Then read by bleve gourinte This is still safe when used correctly However, Reset() will modify the map, which is now a data race This fix is to simply make batch.Reset() alloc new maps. This provides a data-access pattern that can be used safely. Also, this thread argues that creating a new map may be faster than trying to reuse an existing one: https://groups.google.com/d/msg/golang-nuts/UvUm3LA1u8g/jGv_FobNpN0J Separate but related, I have opted to remove the "unsafe batch" checking that we did. This was always limited anyway, and now users of Go 1.6 are just as likely to get a panic from the runtime for concurrent map access anyway. So, the price paid by us (additional mutex) is not worth it. fixes #360 and #260	2016-04-08 15:32:13 -04:00
Marty Schoch	2a703376ea	fix ineffectual assignments	2016-04-02 22:42:56 -04:00
Marty Schoch	7892882519	fix typos	2016-04-02 21:59:30 -04:00
Marty Schoch	194ee82c80	gofmt simplifications	2016-04-02 21:54:33 -04:00
Marty Schoch	639fb1ab89	remove NativeMergeOperator from core, it requires unsafe	2016-03-24 12:06:43 -04:00
Marty Schoch	724684a4f1	additional firestorm fixes for 64-bit alignment part of #359	2016-03-20 11:02:13 -04:00
Marty Schoch	3dc64de478	moved fields requiring 64-bit alignment to start of struct several data structures had a pointer at the start of the struct on some 32-bit systems, this causes the remaining fields no longer be aligned on 64-bit boundaries the fix identifed by @pmezard is to put the counters first in the struct, which guarantees correct alignment fixes #359	2016-03-20 10:38:28 -04:00
Steve Yen	be2800a8e4	MB-18715 - moss Merge() didn't bump bufUsed correctly And, also allocate more memory for both the partial and full merges.	2016-03-15 17:09:40 -07:00
Steve Yen	c1597842d0	moss lowerLevelUpdate didn't handle batches of size 1	2016-03-11 15:47:23 -08:00
Steve Yen	f1dac8b497	moss defaults to non-nil options.Log	2016-03-09 10:15:11 -08:00
Steve Yen	1d63c55f7c	parse mossLowerLevelMaxBatchSize only when lower-level-store exists	2016-03-09 10:09:15 -08:00
Steve Yen	76b9365928	added moss RegistryCollectionOptions The moss RegistryCollectionOptions allows applications to register moss-related callback API functions and other advanced feature usage at process initialization time. For example, this could be used for moss's OnError(), OnEvent() and logging callback options.	2016-03-09 09:40:29 -08:00
Marty Schoch	d7292ed891	add support for gathering stats via map for easier consumption	2016-03-07 18:37:46 -05:00
Marty Schoch	e51f4d5450	changing async test strategy, was failing in go 1.6	2016-03-07 09:39:20 -05:00
Marty Schoch	23a323bc9d	add support for numPlainTextBytesIndexed metric	2016-03-05 14:05:08 -05:00
Marty Schoch	81780f97d0	add term search stats	2016-03-05 07:50:25 -05:00
Marty Schoch	147debaa12	expose metrics and moss stats wrapping underlying stats as well	2016-03-04 13:43:39 -05:00
Steve Yen	f6d1bd2c87	moss option MaxPreMergerBatches renamed	2016-03-03 11:18:30 -08:00
Steve Yen	7d67d89a9c	MB-18441 - moss lower-level iterator starts positioned on current The iterator starts off positioned so that Current() is correct, so invoking Next() right off the bat was incorrect.	2016-03-01 21:45:48 -08:00

1 2 3 4 5 ...

344 Commits