bleve

Author	SHA1	Message	Date
Steve Yen	89d17f01ef	analyze locations only if includeTermVectors enabled With this change, TermLocations are computed and maintained only if includeTermVectors is enabled, for higher performance.	2016-01-05 12:46:46 -08:00
Steve Yen	70b7e73c82	firestorm compensator inFlight.Get() might return nil	2016-01-03 10:21:54 -08:00
Steve Yen	fb8c9a7475	firestorm.Batch() collects [][]IndexRows instead of []IndexRow Rather than append() all received rows into a flat []IndexRow during the result gathering loop, this change instead collects the analysis result rows into a [][]IndexRow, which avoids extra copying. As part of this, firestorm batchRows() now takes the [][]IndexRow as its input.	2016-01-02 12:30:47 -08:00
Steve Yen	1c5b84911d	firestorm DictUpdater NotifyBatch is more async	2016-01-02 12:21:25 -08:00
Steve Yen	b241242465	firestorm.Analyze() preallocs rows, with analyzeField() func The new analyzeField() helper func is used for both regular fields and for composite fields. With this change, all analysis is done up front, for both regular fields and composite fields. After analysis, this change counts up all the row capacity needed and extends the AnalysisResult.Rows in one shot, as opposed to the previous approach of dynamically growing the array as needed during append()'s. Also, in this change, the TermFreqRow for _id is added first, which seems more correct.	2016-01-02 12:21:25 -08:00
Steve Yen	5b2bc1c20f	firestorm.indexField() check for includeTermVectors moved out of loop	2016-01-02 12:21:25 -08:00
Steve Yen	45e9eaaacb	firestorm.indexField() allocs up-front array of TermFreqRow's This uses the "backing array" technique to allocate many TermFreqRow's at the front of firestorm.indexField(), instead of the previous one-by-one, as-needed TermFreqRow allocation approach. Results from micro-benchmark, null-firestorm, bleve-blast has this change producing a ~half MB/sec improvement.	2016-01-02 12:21:24 -08:00
Steve Yen	7ae696d661	firestorm lookuper notified via batch Previously, the firestorm.Batch() would notify the lookuper goroutine on a document by document basis. If the lookuper input channel became full, then that would block the firestorm.Batch() operation. With this change, lookuper is notified once, with a "batch" that is an []InFlightItem. This change also reuses that same []InFlightItem to invoke the compensator.MutateBatch(). This also has the advantage of only converting the docID's from string to []byte just once, outside of the lock that's used by the compensator. Micro-benchmark of this change with null-firestorm bleve-blast does not show large impact, neither degradation or improvement.	2016-01-02 12:21:24 -08:00
Steve Yen	38d50ed8b5	renamed var to docsUpdated to match docsDeleted naming	2016-01-02 12:21:24 -08:00
Steve Yen	3feeb14b7d	firestorm.batchRows reuses buf for all IndexRows	2016-01-02 12:21:24 -08:00
Steve Yen	0a7f7e3df8	firestorm.Analyze() converts docID to bytes only once	2016-01-02 12:21:24 -08:00
Steve Yen	fd81d0364c	firestorm.indexField() uses capacity of len(tokenFreqs)	2016-01-02 12:21:24 -08:00
Steve Yen	ee5ccda112	use KeyTo/ValueTo in firestorm.batchRows After this change, with null kvstore micro-benchmark... GOMAXPROCS=8 ./bleve-blast -source=../../tmp/enwiki.txt \ -count=100000 -numAnalyzers=8 -numIndexers=8 \ -config=../../configs/null-firestorm.json -batch=100 Then TermFreqRow key and value methods dissapear as large boxes from the cpu profile graphs.	2016-01-01 09:57:59 -08:00
Steve Yen	fd287bdfa4	firestorm.md markdown fixes	2016-01-01 09:57:59 -08:00
Steve Yen	b605224106	use shorter go idiom	2015-12-29 22:14:45 -08:00
Antoine Grondin	6806343677	firestore: fix #296 for division by zero on GC	2015-12-25 11:34:19 +07:00
Antoine Grondin	a6f7abdfa3	firestore: reproducer for division by zero on GC	2015-12-25 11:33:46 +07:00
Marty Schoch	8efbd556a3	fix indexing bug with data coming from arrays fixes #295	2015-12-21 14:59:32 -05:00
Marty Schoch	699c86073a	make existing integration tests work with firestorm	2015-12-01 12:29:56 -05:00
Marty Schoch	6d851cfcc2	fix bug in warmup which led to docs being deleted	2015-11-30 10:18:14 -05:00
Marty Schoch	aa8d98f5fa	include space after prefix in log output	2015-11-30 10:17:48 -05:00
Marty Schoch	68d8742826	correctly prefix internal rows with 'i' and print them in debug	2015-11-30 10:17:15 -05:00
Marty Schoch	c93de9734e	fix issues identified by errcheck	2015-11-24 14:32:33 -05:00
Marty Schoch	01526e971f	Merge branch 'master' into firestorm	2015-10-28 11:26:01 -04:00
Marty Schoch	c308f611cf	skip unnecessary map before slice benchmark old ns/op new ns/op delta BenchmarkBatch-4 16950972 16377194 -3.38% benchmark old allocs new allocs delta BenchmarkBatch-4 136164 136161 -0.00% benchmark old bytes new bytes delta BenchmarkBatch-4 7168872 7109691 -0.83%	2015-09-10 08:21:26 -04:00
Marty Schoch	f6f1628b15	avoid doing unnecessary work: benchmark old ns/op new ns/op delta BenchmarkBatch-4 20738739 17047158 -17.80% benchmark old allocs new allocs delta BenchmarkBatch-4 136423 136160 -0.19% benchmark old bytes new bytes delta BenchmarkBatch-4 20277781 7168772 -64.65%	2015-09-10 08:19:05 -04:00
Marty Schoch	1e4d637761	adding more benchmarks	2015-09-10 08:01:11 -04:00
Marty Schoch	18151862b5	fix go vet issues	2015-08-25 15:13:13 -04:00
Marty Schoch	84811cf5a0	made index type configurable + first version of firestorm	2015-08-25 14:52:42 -04:00
Marty Schoch	ae19d77b04	updated protobuf defs to be valid	2015-08-17 15:37:13 -04:00
Marty Schoch	1187436e46	changed Stored row Values to also use protobuf	2015-08-17 09:48:40 -04:00
Marty Schoch	8d8a05a842	fix more issues	2015-08-14 16:27:00 -04:00
Marty Schoch	e0802a2b39	fixed the worst of the formatting	2015-08-14 16:17:48 -04:00
Marty Schoch	f4df56eb7c	add first draft of firestorm proposal	2015-08-14 16:09:19 -04:00

34 Commits