bleve

Author	SHA1	Message	Date
Sreekanth Sivasankaran	debbcd7d47	adding maxsegment size limit checks	2018-03-13 17:35:54 +05:30
Steve Yen	dbfc5e9130	scorch zap reuse interim freq/norm/loc slices	2018-03-12 10:04:11 -07:00
Steve Yen	07901910e2	scorch zap reuse roaring Bitmap in prepareDicts() slice growth In this change, if the postings/postingsLocs slices need to be grown, then copy over and reuse any of the preallocated roaring Bitmap's from the old slice.	2018-03-12 09:19:38 -07:00
Steve Yen	b1f3969521	scorch zap reuse roaring Bitmap in postings lists	2018-03-12 09:18:11 -07:00
Steve Yen	cad88096ca	scorch zap reuse roaring Bitmap during merge	2018-03-12 09:17:37 -07:00
Steve Yen	c4ceffe584	scorch zap sync Pool for interim data	2018-03-12 09:17:37 -07:00
Steve Yen	531800c479	scorch zap use roaring Add() instead of AddInt() This change invokes Add() directly as AddInt() is a convenience wrapper around Add().	2018-03-12 09:17:37 -07:00
Sreekanth Sivasankaran	f9545bef2f	Merge pull request #800 from blevesearch/numsnapshots_config making NumSnapshotsToKeep configurable	2018-03-12 20:59:03 +05:30
Sreekanth Sivasankaran	90aa91105a	handling only int, float64 values	2018-03-12 20:24:51 +05:30
Steve Yen	6df6a036d8	Merge pull request #817 from steveyen/zap-no-longer-uses-mem-segment scorch zap no longer uses mem segment	2018-03-12 07:54:10 -07:00
Steve Yen	2a20a36e15	scorch zap optimimze to avoid bitmaps for 1-hit posting lists This commit avoids creating roaring.Bitmap's (which would have just a single entry) when a postings list/iterator represents a single "1-hit" encoding.	2018-03-10 06:33:09 -08:00
Steve Yen	5abf7b7a19	scorch zap remove mem.Segment usage from persist / build.go	2018-03-09 15:23:58 -08:00
Steve Yen	eade78be2f	scorch zap unit tests no longer use mem.Segment	2018-03-09 15:23:58 -08:00
Steve Yen	e82774ad20	scorch zap AnalysisResultsToSegmentBase() AnalysisResultsToSegmentBase() allows analysis results to be directly converted into a zap-encoded SegmentBase, which can then be introduced onto the root, avoiding the creation of mem.Segment data structures. This leads to some reduction of garbage memory allocations. The grouping and sorting and shaping of the postings list information is taken from the mem.Segment codepaths. The encoding of stored fields reuses functions from zap's merger, which has the largest savings of garbage memory avoidance. And, the encoding of tf/loc chunks, postings & dictionary information also follows the approach used by zap's merger, which also has some savings of garbage memory avoidance. In future changes, the mem.Segment dependencies will be removed from zap, which should result in a smaller codebase.	2018-03-09 15:22:30 -08:00
Steve Yen	3884cf4d12	scorch zap writePostings() helper func refactored out	2018-03-09 13:29:28 -08:00
Sreekanth Sivasankaran	b04909d3ee	adding the integer parser utility	2018-03-09 11:05:17 +05:30
Steve Yen	25beba615d	scorch mem processDocument reuses fieldLens/docMap arrays This change produces less garbage by switching from a map[uint16]'s to array's for the fieldLens and docMap, and then reusing those arrays across multiple processDocument() calls.	2018-03-08 13:04:51 -08:00
Steve Yen	eac9808990	scorch zap optimize FST val encoding for terms with 1 hit NOTE: this is a scorch zap file format change / bump to version 4. In this optimization, the uint64 val stored in the vellum FST (term dictionary) now may either be a uint64 postingsOffset (same as before this change) or a uint64 encoding of the docNum + norm (in the case where a term appears in just a single doc).	2018-03-08 09:19:54 -08:00
Steve Yen	1e2bb14f13	added TestRoaringSizes()	2018-03-07 10:53:24 -08:00
Steve Yen	0ec4a1935a	Merge pull request #808 from steveyen/more-scorch-optimizing err fix and more scorch optimizing	2018-03-07 10:39:20 -08:00
Abhinav Dangeti	06be1ad72e	Merge pull request #806 from abhinavdangeti/master Fixing the scorch search request memory estimate	2018-03-07 10:11:24 -08:00
Steve Yen	2b5da7a819	go fmt	2018-03-07 09:12:55 -08:00
Steve Yen	59eb70d020	scorch zap remove unused chunkedIntCoder field	2018-03-07 09:11:10 -08:00
Steve Yen	79f28b7c93	scorch fix persistDocValues() err return	2018-03-07 09:11:10 -08:00
Steve Yen	8c0f402d4b	scorch zap optimize processDocument() loc inner loop	2018-03-07 09:11:10 -08:00
Steve Yen	15242af465	Merge pull request #805 from steveyen/optimize-scorch-mem-processField Optimize scorch processField() inner loop and writeRoaringWithLen()	2018-03-07 09:09:57 -08:00
Sreekanth Sivasankaran	73ed8e248d	fixing the indentation issues. looks like it happened during the web based conflict resolution..	2018-03-07 18:34:54 +05:30
Sreekanth Sivasankaran	e0369a3553	Merge branch 'master' into compaction_bytes_stats	2018-03-07 14:47:33 +05:30
Sreekanth Sivasankaran	2a9739ee1b	naming change, interface removal	2018-03-07 14:43:33 +05:30
abhinavdangeti	5c721226cf	Fixing the scorch search request memory estimate Do not re-account for certain referenced data in the zap structures. New estimates: ESTIMATE BENCHMEM TermQuery 11396 12437 MatchQuery 12244 12951 DisjunctionQuery (Term queries) 20644 20709	2018-03-06 16:03:10 -08:00
Steve Yen	8841d79d26	scorch optimize mem processField inner-loop	2018-03-06 15:26:54 -08:00
Steve Yen	dde6c2e01b	scorch zap optimize writeRoaringWithLen() Before this change, writeRoaringWithLen() would leverage a reused bytes.Buffer (#A) and invoke the roaring.WriteTo() API. But, it turns out the roaring.WriteTo() API has a suboptimal implementation, in that underneath-the-hood it converts the roaring bitmap to a byte buffer (using roaring.ToBytes()), and then calls Write(). But, that Write() turns out to be an additional memcpy into the provided bytes.Buffer (#A). By directly invoking roaring.ToBytes(), this change to writeRoaringWithLen() avoids the extra memory allocation and memcpy.	2018-03-06 14:59:20 -08:00
Steve Yen	b62ca996f6	scorch zap optimize chunkedIntCoder.Add() calls to use multiple vals This change leverages the ability for the chunkedIntCoder.Add() method to accept multiple input param values (via the '...' param signature), meaning there are fewer Add() invocations.	2018-03-06 14:11:41 -08:00
abhinavdangeti	38b6c522b0	Address build breakage after rebase Removed attribute: iterator of type Posting	2018-03-06 14:00:54 -08:00
abhinavdangeti	7e36109b3c	MB-28162: Provide API to estimate memory needed to run a search query This API (unexported) will estimate the amount of memory needed to execute a search query over an index before the collector begins data collection. Sample estimates for certain queries: {Size: 10, BenchmarkUpsidedownSearchOverhead} ESTIMATE BENCHMEM TermQuery 4616 4796 MatchQuery 5210 5405 DisjunctionQuery (Match queries) 7700 8447 DisjunctionQuery (Term queries) 6514 6591 ConjunctionQuery (Match queries) 7524 8175 Nested disjunction query (disjunction of disjunctions) 10306 10708 …	2018-03-06 13:53:42 -08:00
Steve Yen	5b86da85f3	scorch zap optimize postings itr with tf/loc reader/decoder reuse	2018-03-06 13:30:59 -08:00
Steve Yen	530a3d24cf	scorch zap optimize merge by byte copying freq/norm/loc's This change adds a zap PostingsIterator.nextBytes() method, which is similar to Next(), but instead of returning a Posting instance, nextBytes() returns the encoded freq/norm and location byte slices. The zap merge code then provides those byte slices directly to the intCoder's via a new method, intCoder.AddBytes(), thereby avoiding having to encode many uvarint's.	2018-03-06 13:30:59 -08:00
Steve Yen	655268bec8	scorch zap postings iterator nextDocNum() helper method Refactored out a nextDocNum() helper method from Next() that future optimizations can use.	2018-03-06 07:55:26 -08:00
Sreekanth Sivasankaran	fa5de8e09a	making NumSnapshotsToKeep configurable	2018-03-06 16:22:11 +05:30
Steve Yen	502e64c256	scorch zap Posting doesn't use iterator field	2018-03-05 16:33:13 -08:00
Steve Yen	8f8fd511b7	scorch zap access freqs[offset] outside loop	2018-03-05 12:02:33 -08:00
Steve Yen	a338386a03	scorch build optimize freq/loc slice capacity	2018-03-05 12:02:33 -08:00
Steve Yen	856778ad7b	scorch zap build prealloc docNumbers capacity	2018-03-05 12:02:33 -08:00
Steve Yen	8c0881eab2	scorch zap build reuses mem postingsList/Iterator structs	2018-03-05 12:02:33 -08:00
Steve Yen	85761c6a57	go fmt	2018-03-05 12:02:33 -08:00
Steve Yen	d44c5ad568	scorch stats MaxBatchIntroTime bug fix and more timing stats Added timing stats for in-mem zap merging and file-based zap merging.	2018-03-05 12:02:33 -08:00
Sreekanth Sivasankaran	395b0a312d	adding UTs	2018-03-05 17:02:58 +05:30
Sreekanth Sivasankaran	dec265c481	adding compaction_written_bytes/sec stats to scorch	2018-03-05 16:32:57 +05:30
Steve Yen	884da6f93a	scorch optimize mem processDocument() norm calculation This change moves the norm calculation outside of the inner loop.	2018-03-03 11:58:30 -08:00
Steve Yen	6ae799052a	scorch mem optimize processDocument() stored field	2018-03-03 11:52:33 -08:00

1 2 3 4 5 ...

719 Commits