bleve

Author	SHA1	Message	Date
Marty Schoch	b8a2fbb887	fix data race in bleve batch reuse Currently bleve batch is build by user goroutine Then read by bleve gourinte This is still safe when used correctly However, Reset() will modify the map, which is now a data race This fix is to simply make batch.Reset() alloc new maps. This provides a data-access pattern that can be used safely. Also, this thread argues that creating a new map may be faster than trying to reuse an existing one: https://groups.google.com/d/msg/golang-nuts/UvUm3LA1u8g/jGv_FobNpN0J Separate but related, I have opted to remove the "unsafe batch" checking that we did. This was always limited anyway, and now users of Go 1.6 are just as likely to get a panic from the runtime for concurrent map access anyway. So, the price paid by us (additional mutex) is not worth it. fixes #360 and #260	2016-04-08 15:32:13 -04:00
Marty Schoch	7892882519	fix typos	2016-04-02 21:59:30 -04:00
Marty Schoch	194ee82c80	gofmt simplifications	2016-04-02 21:54:33 -04:00
Marty Schoch	3dc64de478	moved fields requiring 64-bit alignment to start of struct several data structures had a pointer at the start of the struct on some 32-bit systems, this causes the remaining fields no longer be aligned on 64-bit boundaries the fix identifed by @pmezard is to put the counters first in the struct, which guarantees correct alignment fixes #359	2016-03-20 10:38:28 -04:00
Steve Yen	be2800a8e4	MB-18715 - moss Merge() didn't bump bufUsed correctly And, also allocate more memory for both the partial and full merges.	2016-03-15 17:09:40 -07:00
Marty Schoch	d7292ed891	add support for gathering stats via map for easier consumption	2016-03-07 18:37:46 -05:00
Marty Schoch	23a323bc9d	add support for numPlainTextBytesIndexed metric	2016-03-05 14:05:08 -05:00
Marty Schoch	81780f97d0	add term search stats	2016-03-05 07:50:25 -05:00
Steve Yen	a29dd25a48	upside_down dict row value size accounts for large uvarint's This is somewhat unlikely, but if a term is (incredibly) popular, its uvarint count value representation might go beyond 8 bytes. Some KVStore implementations (like forestdb) provide a BatchEx cgo optimization that depends on proper preallocated counting, so this change provides a proper worst-case estimate based on the max-unvarint of 10 bytes instead of the previously incorrect 8 bytes.	2016-02-22 11:52:51 -08:00
Marty Schoch	40c95513b7	add support for including kvstore stats	2016-02-05 12:26:19 -05:00
Marty Schoch	c5dea9e882	fix accessing store via Advanced() method which was broken	2016-02-02 11:54:18 -05:00
Marty Schoch	fc34a97875	copy locations on merge for more safe/predictable behavior fixes #328	2016-01-19 14:21:48 -05:00
Steve Yen	035d9d0e40	unneeded cast and parens	2016-01-17 00:16:05 -08:00
Marty Schoch	1335eb2a7b	Merge pull request #322 from steveyen/WIP-perf-20160113 KVReader.MultiGet and KVWriter.NewBatchEx API's	2016-01-15 14:28:59 -05:00
Silvan Jegen	d326898f7b	Remove unneeded brackets	2016-01-14 16:41:41 +01:00
Steve Yen	6849e538be	upside_down and firestorm use new NewBatchEx() API With this change, the upside_down batchRows() and firestorm batchRows() now use the new KVWriter.NewBatchEx() API, which can improve performance by reducing the number of cgo hops.	2016-01-13 23:08:20 -08:00
Steve Yen	fe39b3fd13	avoid fieldTermFreqs loop if no composite fields	2016-01-13 14:45:04 -08:00
Marty Schoch	af25e724f6	Merge branch 'master' of https://github.com/slavikm/bleve into slavikm-master	2016-01-13 16:10:59 -05:00
Steve Yen	0e72b949b3	upside_down batchRows() takes array of arrays In order to spend less time in append(), this change in upside_down (similar to another recent performance change in firestorm) builds up an array of arrays as the eventual input to batchRows().	2016-01-11 18:11:21 -08:00
slavikm	680be52f87	Implemented boolean field support	2016-01-11 17:18:03 -08:00
Steve Yen	7ce7d98cba	upside_down merge dictionary deltas before using batch.Merge() This change performs more dictionary delta incr/decr math in batchRows() instead of in the KVStore ExecuteBatch() machinery.	2016-01-11 16:52:07 -08:00
Steve Yen	94273d5fa9	upside_down process internal rows earlier With this change, internal rows are processed while we're waiting for backIndex rows to be retrieved.	2016-01-11 16:25:35 -08:00
Steve Yen	bb5cd8f3d6	upside_down merge backIndexRow concurrently Previously, the code would gather all the backIndexRows before processing them. This change instead merges the backIndexRows concurrently on the theory that we might as well make progress on compute & processing tasks while waiting for the rest of the back index rows to be fetched from the KVStore.	2016-01-10 18:50:42 -08:00
Steve Yen	c3b5246b0c	upside_down track analysis time tighter; and comments	2016-01-10 15:36:54 -08:00
Steve Yen	d3dd40d334	upside_down retrieves backindex concurrently with analysis Start backindex reading concurrently with analysi to try to utilize more I/O bandwidth. The analysis time vs indexing time stats tracking are also now "off", since there's now concurrency between those actiivties. One tradeoff is that the lock area in upside_down Batch() is increased as part of this change.	2016-01-10 15:18:28 -08:00
Steve Yen	860de28a28	fix memory leak by closing batches in batchRows()	2016-01-07 17:59:42 -08:00
Steve Yen	846912d083	upside_down udc.termVectorsFromTokenFreq rows append optimization	2016-01-07 00:48:34 -08:00
Steve Yen	8b980bd2ef	firestorm avoid extra goroutine, similar to upside_down	2016-01-07 00:43:27 -08:00
Steve Yen	fbd0e7bfe9	upside_down backIndexTermEntries precalloc'ed capacity	2016-01-07 00:23:25 -08:00
Steve Yen	4eee8821f9	upside_down storeField/indexField append to provided arrays Taking another optimization from firestorm, upside_down's storeField()/indexField() funcs now also append() to passed-in arrays rather than always allocating their own arrays.	2016-01-07 00:13:46 -08:00
Steve Yen	1af2927967	upside_down gets analysis perf rows optimizations from firestorm	2016-01-06 23:53:13 -08:00
Steve Yen	82b8b3468e	upside_down analysis converts to docIDBytes once	2016-01-06 23:38:02 -08:00
Steve Yen	89d17f01ef	analyze locations only if includeTermVectors enabled With this change, TermLocations are computed and maintained only if includeTermVectors is enabled, for higher performance.	2016-01-05 12:46:46 -08:00
Marty Schoch	8efbd556a3	fix indexing bug with data coming from arrays fixes #295	2015-12-21 14:59:32 -05:00
Marty Schoch	a73a178923	fix incorrect prefix search behavior avoids double incrementing of end term when reading term dict fixes #293	2015-12-04 14:07:16 -05:00
Patrick Mezard	e85c9c542e	row: expose TermFrequencyRow term and freq fields Rows content is an implementation detail of bleve index and may change in the future. That said, they also contains information valuable to assess the quality of the index or understand its performances. So, as long as we agree that type asserting rows should only be done if you know what you are doing and are ready to deal with future changes, I see no reason to hide the row fields from external packages. Fix #268	2015-11-17 17:21:26 +01:00
Marty Schoch	30651065e9	fix panic on insufficiently sized buffer adds test case to reproduce original problem fixes #264	2015-10-30 18:25:38 -04:00
Marty Schoch	2bd3ef4080	copy relevant k/v pairs before advancing underlying iterator	2015-10-28 12:23:54 -04:00
Marty Schoch	d1b07f4909	fix dump methods to properly copy keys and values	2015-10-28 12:06:44 -04:00
Marty Schoch	6cc21346dc	fix errcheck issues	2015-10-19 14:27:03 -04:00
Marty Schoch	817c317c90	Merge branch 'master' into newkvstore	2015-10-19 12:04:07 -04:00
Marty Schoch	faceecf87b	make row buffer size constant/configurable also handle case where it is insufficiently sized	2015-10-19 12:03:38 -04:00
Marty Schoch	c9471d5739	Merge pull request #244 from kevgs/master reducing allocation count	2015-10-16 15:51:30 -04:00
Marty Schoch	e6d0fc8d95	Merge pull request #247 from pmezard/remove-update-goroutine upside_down: no need for a goroutine to enqueue AnalysisWork	2015-10-16 10:15:55 -04:00
Marty Schoch	4c6bc23043	rewrite to keep using same buffer when possible	2015-10-13 14:04:56 -07:00
Marty Schoch	8de860bf12	2 more places that used old Key()	2015-10-13 12:35:08 -07:00
Marty Schoch	5f594d1acc	Merge branch 'master' into newkvstore	2015-10-12 18:07:04 -07:00
Marty Schoch	08572e4925	move literals outside loop for more predicatble test results	2015-10-12 18:06:38 -07:00
Patrick Mezard	8c928539ee	upside_down: no need for a goroutine to enqueue AnalysisWork It boils down to: 1. client sends some work and a notification channel to a single worker, then waits. 2. worker processes the work 3. worker sends the result to the client using the notification channel I do not see any problem with this, even with unbuffered channels.	2015-10-12 10:42:14 +02:00
Marty Schoch	95e06538f3	fix benchmarks for the x kvstores	2015-10-09 11:09:42 -04:00

1 2 3 4

160 Commits