bleve

Author	SHA1	Message	Date
Steve Yen	d5de1d3da1	metrics implements BatchEx correctly	2016-01-21 11:00:41 -08:00
Marty Schoch	1335eb2a7b	Merge pull request #322 from steveyen/WIP-perf-20160113 KVReader.MultiGet and KVWriter.NewBatchEx API's	2016-01-15 14:28:59 -05:00
opennota	8517feb1c6	Fix some typos	2016-01-15 05:46:27 +07:00
Steve Yen	d94ccf2d74	added KVWriter.NewBatchEx() method	2016-01-13 16:19:04 -08:00
Steve Yen	fb048f6c64	added KVReader.MultiGet() method	2016-01-13 15:12:10 -08:00
Steve Yen	70105477cf	added Close() method to KVBatch interface	2016-01-07 17:54:21 -08:00
Marty Schoch	ff11f83842	properly handle errors inside metrics kvstore reporting	2015-11-24 12:52:03 -05:00
Kosov Eugene	45e670b99b	BoltDB wrapper nano optimization which makes code a bit prettier too	2015-11-05 00:27:28 +03:00
Marty Schoch	1a978a4591	fix go vet issues and cleanup reader/iterator	2015-10-26 16:41:58 -04:00
Marty Schoch	f0d282f5f8	add test case for seeing prefix iterators outside of range similar to #256 except for prefix iterators includes fix for boltdb and gtreap which had incorrect behavior	2015-10-26 16:14:29 -04:00
Patrick Mezard	da72d0c2b9	store_test: deduplicate store initialization	2015-10-20 19:21:01 +02:00
Patrick Mezard	873f483804	gtreap: RangeIterator.Seek should not move before start	2015-10-20 19:12:30 +02:00
Patrick Mezard	5d7628ba3b	boltdb: fix RangeIterator outside of range seeks Two issues: - Seeking before i.start and iterating returned keys before i.start - Seeking after the store last key did not invalidate the iterator and could cause infinite loops.	2015-10-20 19:09:51 +02:00
Patrick Mezard	aada2e7333	store_test: test RangeIterator.Seek on goleveldb	2015-10-20 19:09:38 +02:00
Marty Schoch	f0ee9a3c66	removed commented code and unused functions	2015-10-19 11:13:03 -04:00
Marty Schoch	0f05d1d3ca	Merge branch 'master' into newkvstore	2015-10-09 10:33:41 -04:00
Patrick Mezard	9d5407be13	boltdb: add "nosync" option to force boltdb.DB.NoSync=true Use this option when rebuilding indexes from scratch. In my small case (~17000 json documents), it reduces indexing from 520s to 250s. I did not add any test, short of forced indexing termination it only has performance effects, which are hard to test. And unknown options are currently ignored. Issue #240	2015-10-03 14:26:48 +02:00
Marty Schoch	d06b526cbf	more refactoring	2015-09-28 16:50:27 -04:00
Marty Schoch	900f1b4a67	major kvstore interface and impl overhaul clarified the interface contract	2015-09-23 11:25:47 -07:00
Marty Schoch	f81b2be334	major refactor of bleve configuration see #221 for full details	2015-09-16 17:10:59 -04:00
Marty Schoch	7ad7659ce5	add support for using null kvstore outside of bleve internals	2015-09-02 11:50:06 -04:00
Marty Schoch	07d37ca38a	add important rocksdb config options	2015-09-02 11:49:42 -04:00
Marty Schoch	3e60ca24ec	support using end key on forestdb iterator for term freq lookup also additoanl forestdb configs	2015-08-18 16:22:02 -04:00
Marty Schoch	d3dda3d0ea	fixup config parsing and add new options	2015-08-12 13:18:23 -04:00
Marty Schoch	bf8dcae76b	removing build tags	2015-07-28 18:59:10 -04:00
Marty Schoch	17ef48f82a	switching back to the canonical goleveldb repo	2015-07-08 12:21:17 -06:00
Marty Schoch	bf80f4628e	fix bug in curent goleveldb (must copy during iteration) also changed over to mschoch fork of goleveldb (temporary) the change to my fork is pending some read-only issues described here: https://github.com/syndtr/goleveldb/issues/111 hopefully we can find a path forward, and get that addressed upstream	2015-07-06 18:00:05 -04:00
Marty Schoch	7be7ecdf8e	fix batch indexing bug, incremented docCount before commit fixes #211	2015-06-08 14:14:05 -04:00
Marty Schoch	a52d3b5c07	put in hack to allow boltdb reader isolation test to pass in boltdb, long readers MAY block a writer. in particular if the write requires additional allocation, it must acquire a lock already held by the reader. in general this is not a problem for bleve (though it can affect performance in some cases), but it is a problem for the reader isolation test. this commit adds a hack to try and avoid the need for additional allocation closes #208	2015-05-21 11:39:59 -04:00
Steve Yen	e98ae8ab71	update metrics store to latest kvstore api	2015-04-27 11:01:53 -07:00
Marty Schoch	16f538d7b7	close documents returned by iterator before losing their reference fixes #194	2015-04-24 17:48:21 -04:00
Marty Schoch	b54a59139c	change forestdb imports to couchbase not couchbaselabs	2015-04-24 17:35:01 -04:00
Marty Schoch	452fea6a24	adding initial impl of rocksdb kv store	2015-04-24 17:19:44 -04:00
Marty Schoch	a9c07acbfa	refactor of kvstore api to support native merge in rocksdb refactor to share code in emulated batch refactor to share code in emulated merge refactor index kvstore benchmarks to share more code refactor index kvstore benchmarks to be more repeatable	2015-04-24 17:13:50 -04:00
indraniel	a62320a50e	+ fix goleveldb's BytesSafeAfterClose() on reader - it should be set to false	2015-04-10 15:45:22 -05:00
Marty Schoch	5f66bd84c7	fix issues identified by errcheck	2015-04-10 14:59:05 -04:00
indraniel	54ab493b3e	+ correctly copy bytes from the goleveldb store - this is part of a recent bleve KVStore API change. See the following two google group threads for more details: * [help adding goleveldb as an alternative Key/Value store for bleve][1] * [bleve search performance improvement][2] [1]: https://groups.google.com/forum/#!topic/bleve/aHZ8gmihLiY [2]: https://groups.google.com/forum/#!topic/bleve/aTyqsSnbhik	2015-04-10 11:25:23 -05:00
indraniel	81bef38cce	Revert "+ make copies of the []bytes returned by goleveldb" This reverts commit cb8c1741289a0f00b30733e0d52d9d81d1199603. This commit is no longer desired. The KV store API has been changed to better address this issue. For more details, see the google group conversation thread at: https://groups.google.com/forum/#!topic/bleve/aHZ8gmihLiY	2015-04-10 11:12:44 -05:00
indraniel	3a70401835	+ make copies of the []bytes returned by goleveldb - The byte strings returned by goleveldb aren't necessarily safe. See the following google group thread: https://groups.google.com/forum/#!topic/bleve/aHZ8gmihLiY This code change is based on the gist created here: https://groups.google.com/forum/#!topic/bleve/aHZ8gmihLiY	2015-04-10 11:08:02 -05:00
indraniel	a0a2a61050	+ keep 'get' consistent with levigo implementation - this change keeps the method behavior consistent with the levigo/leveldb implementation. - don't issue an err if a key isn't found	2015-04-10 11:08:02 -05:00
indraniel	5e55fa2866	+ keep 'getWithSnapshot' consistent with levigo implementation - this change keeps the method behavior consistent with the levigo/leveldb implementation. - the leveldb store_test.go and goleveldb store_test.go are now identical.	2015-04-10 11:08:02 -05:00
indraniel	caa19e6c36	+ initial stub of goleveldb package - This is a first-pass introduction. Things may not be working correctly yet.	2015-04-10 11:08:02 -05:00
Marty Schoch	ab24772bf0	fix issues identified by errcheck part of #169	2015-04-07 16:34:29 -04:00
Marty Schoch	56c4a09de1	fix issues identified by errcheck part of #169	2015-04-07 15:39:56 -04:00
Marty Schoch	93e01a803e	fix issues identified by errcheck part of #169	2015-04-07 14:52:00 -04:00
Marty Schoch	dd921d31e3	undoing `f92ab131e4` we now guarantee bytes were copied earlier in the chain the kv store is NOT responsible for making an additional copy closes #181	2015-04-07 11:12:28 -04:00
Marty Schoch	443c0252e0	fix another metrics BytesSafeAfterClose() loop closes #184	2015-04-03 21:17:23 -04:00
Steve Yen	efc39a6857	fix metrics BytesSafeAfterClose() loop fixes issue 184	2015-04-03 16:36:32 -07:00
Marty Schoch	867110e03b	major improvements to index row encoding improvements uncovered some issues with how k/v data was copied or not. to address this, kv abstraction layer now lets impl specify if the bytes returned are safe to use after a reader (or writer since writers are also readers) are closed See index/store/KVReader - BytesSafeAfterClose() bool false is the safe value if you're not sure it will cause index impls to copy the data Some kv impls already have created a copy a the C-api barrier in which case they can safely return true. Overall this yields ~25% speedup for searches with leveldb. It yields ~10% speedup for boltdb. Returning stored fields is now slower with boltdb, as previously we were returning unsafe bytes.	2015-04-03 16:50:48 -04:00
Steve Yen	dbf50b7f29	KVStore gtreap allows only 1 writer at a time	2015-03-26 16:40:18 -07:00
Steve Yen	f92ab131e4	KVStore gtreap implementation copies value bytes	2015-03-26 14:46:37 -07:00
Steve Yen	78453dab7d	metrics KVStore now tracks last 100 errors	2015-03-19 18:41:16 -07:00
Steve Yen	38ee9be353	added some batch size 1000 microbenchmarks	2015-01-30 15:58:39 -08:00
Steve Yen	7d6a6aeaa8	single append for inmem KVStore batch	2015-01-29 11:14:08 -08:00
Steve Yen	5a30d36b17	cznicb KVStore uses Put() for faster read-modify-write	2015-01-29 11:02:01 -08:00
Steve Yen	b054cddf76	gtreap KVStore does 1 append for batch Set/Delete	2015-01-29 10:49:39 -08:00
Steve Yen	05d222f490	cznicb KVStore batch uses <2 appends per Set/Delete	2015-01-29 10:22:13 -08:00
Steve Yen	c5c59e61f4	make leveldb faster with non-zero sized batch	2015-01-29 10:20:26 -08:00
Steve Yen	1c1774d4ad	throw away data even faster in null KVStore	2015-01-29 10:17:21 -08:00
Steve Yen	782ad94e01	added debug tag for metrics KVStore	2015-01-16 11:18:40 -08:00
Marty Schoch	eebc8e7825	more debuging around forestdb snapshots	2015-01-16 14:18:28 -05:00
Marty Schoch	ba978ea27e	improving log messages	2015-01-16 14:07:47 -05:00
Marty Schoch	09fe749913	default to autocompaction for forestdb	2015-01-16 13:35:43 -05:00
Steve Yen	12dc2aff93	add go1.4 build tag to cznicb KVStore This is because github.com/cznic/b depends on sync.Pool.	2015-01-15 15:54:25 -08:00
Steve Yen	11ee0209ad	no leading zeros for metrics CSV output	2015-01-15 15:09:53 -08:00
Steve Yen	202191201c	added WriteCSV() to metrics KVStore	2015-01-15 14:11:15 -08:00
Steve Yen	9be4e217bc	metrics KVStore tracks perf metrics on a wrapped KVStore	2015-01-15 11:42:41 -08:00
Steve Yen	ea0a8657f3	added cznicb in-memory kvstore (no reader isolation)	2015-01-13 17:35:28 -08:00
Marty Schoch	362d240b09	added configurable options to leveldb	2015-01-13 16:24:51 -05:00
Steve Yen	d6e6f655c9	initialize forestdb config if provided	2015-01-13 12:03:24 -08:00
Steve Yen	1fa80ffc40	pass config to forestdb Open()	2015-01-13 11:04:02 -08:00
Steve Yen	3a00a968f2	close levigo's read & write options	2015-01-12 18:42:19 -08:00
Steve Yen	c20726bb93	close levigo.Options when db is closed	2015-01-12 18:42:19 -08:00
Steve Yen	603c3af8bb	added gtreap in-memory, copy-on-write KVStore	2015-01-12 11:26:21 -08:00
Steve Yen	ae3600aeea	expose forestdb rollback methods	2015-01-06 18:59:02 -08:00
Steve Yen	5467e0a385	forestdb registered name fixed	2015-01-06 17:36:05 -08:00
Marty Schoch	38bdcbeb62	update to new forestdb iterator api	2014-12-27 13:15:14 -08:00
Silvan Jegen	ef18dfe4cd	Fix typos in comments and strings	2014-12-18 18:43:12 +01:00
Silvan Jegen	412049d63c	Remove unneeded import statements	2014-11-29 14:25:24 +01:00
Marty Schoch	6c7237ade9	added test for null kvstore	2014-11-26 15:50:57 -05:00
Marty Schoch	8ad0f64459	upgrade to current forestdb api	2014-11-25 21:52:35 -05:00
Marty Schoch	d5c1f4a9ab	refactored store tests	2014-11-25 21:52:23 -05:00
Marty Schoch	47bc7caec3	added getRollbackID() and rollbackTo() to the ForestDB store	2014-11-04 08:34:49 -05:00
Marty Schoch	3f83149ed3	adding back the forestdb kv store impl	2014-10-31 09:42:32 -04:00
Marty Schoch	c7443fe52b	refactored API a bit more things can return error now in a couple of places we had to swallow errors because they didn't fit the existing API. in these case and proactively in a few others we now return error as well. also the batch API has been updated to allow performing set/delete internal within the batch	2014-10-31 09:40:23 -04:00
Marty Schoch	97902e2619	text analysis now moved out of index write lock onto goroutine 1. text analysis is now done before the write lock is acquired 2. there is now a pool of analysis workers 3. the size of this pool is configurable 4. this allows for documents in a batch to be analyzed concurrently as a part of benchmarking these changes i've also introduce a new null storage implementation. this should never be used, as it does not actualy build an index. it does however let us go through all the normal indexing machinery, without incuring any indexing I/O. this is very helpful in measuring improvements made to the text analsysis pipeline, which are often overshadowed by indexing times in benchmarks actually building an index.	2014-09-24 08:13:14 -04:00
Marty Schoch	198ca1ad4d	major refactor of kvstore/index internals, see below In the index/store package introduce KVReader creates snapshot all read operations consistent from this snapshot must close to release introduce KVWriter only one writer active access to all operations allows for consisten read-modify-write must close to release introduce AssociativeMerge operation on batch allows efficient read-modify-write for associative operations used to consolidate updates to the term summary rows saves 1 set and 1 get op per shared instance of term in field In the index package introduced an IndexReader exposes a consisten snapshot of the index for searching At top level All searches now operate on a consisten snapshot of the index	2014-09-12 17:21:35 -04:00
Marty Schoch	2294b24b9d	remove forestdb for now not any benfefit in maintaining this for the time being	2014-09-12 16:55:11 -04:00
Marty Schoch	e21935f850	another round of golint cleanup	2014-09-03 19:16:46 -04:00
Marty Schoch	e1b77956d4	more golint cleanups	2014-09-03 18:47:02 -04:00
Marty Schoch	45e1b2dfc6	removing gouchstore store impl this implementation didn't really adhere to the contract and now that we have boltdb we have a better pure go impl	2014-09-02 13:56:35 -04:00
Marty Schoch	7a7eb2e94c	add newline between license and package this avoids cluttering godocs with the license	2014-09-02 10:54:50 -04:00
Marty Schoch	1161361bea	rename imports from couchbaselabs to blevesearch	2014-08-28 15:38:57 -04:00
Marty Schoch	ef59abe4c9	added build tag 'leveldb' to enable this kv store by default we now use the pure go boltdb kv store it is less tested at this point but appears to work test pass, and moves us closer to the goal of being able to just "go get" bleve	2014-08-25 15:18:24 -04:00
Marty Schoch	8bcf6adb60	changed close of read only tx to Rollback from Commit i was seeing deadlocks before this change using Rollback to close read only tx is what the built-in View() impl does, so i think its safe	2014-08-25 15:11:21 -04:00
Marty Schoch	d67ee483ba	change default bucket name to bleve	2014-08-25 15:11:04 -04:00
Marty Schoch	e7a8a1fbe6	fixing test	2014-08-25 12:34:16 -04:00
deoxxa	a993fa4f74	add boltdb storage type	2014-08-24 18:37:56 +10:00
Marty Schoch	27f001bc14	overhauled top-level New/Open API New is now used to create new indexes Open is used to open existing indexes calls to Open no longer specify a mapping because the mapping is serialized and stored along with the index	2014-08-20 16:58:20 -04:00
Marty Schoch	216767953c	introduced a config option to disable creating indexes if they don't already exist closes #23 and closes #24	2014-07-30 14:29:26 -04:00

1 2 3 4

159 Commits