0
0
Fork 0
Commit Graph

1439 Commits

Author SHA1 Message Date
Marty Schoch 00722aa299 disable http unit test which relied on debug functionality 2017-12-11 15:38:44 -05:00
Marty Schoch d7eb223e14 remove bolt segment format
upcomning breaking changes and no desire to maintain
2017-12-11 10:20:26 -05:00
Marty Schoch eada7b209b fix test issue identified by sreekanth 2017-12-11 10:16:56 -05:00
Marty Schoch 8280859bb8 handle read-only and in-mem only cases 2017-12-11 09:07:01 -05:00
Marty Schoch e8cc7ac0bf add new fields command to zap cmd-line util 2017-12-11 09:05:50 -05:00
Marty Schoch 690cd39921 add crazy slow but functional DocumentVisitFieldTerms 2017-12-10 08:55:59 -05:00
Marty Schoch dc0adc8827 add fsync 2017-12-09 20:52:01 -05:00
Marty Schoch e0d9828cd0 add more detail to the readme 2017-12-09 14:42:36 -05:00
Marty Schoch 414899618b switch from bolt format to zap in the persister 2017-12-09 14:28:50 -05:00
Marty Schoch 9781d9b089 add initial version of zap file format 2017-12-09 14:28:33 -05:00
Marty Schoch ff2e6b98e4 added empty segment 2017-12-09 12:43:02 -05:00
Marty Schoch e470105635 fix issues identified by errcheck 2017-12-06 18:36:14 -05:00
Marty Schoch adac4f41db initial version of scorch which persists index to disk 2017-12-06 18:33:47 -05:00
Marty Schoch b1346b4c8a add readme describing our use of bolt as a segment format 2017-12-05 16:09:00 -05:00
Marty Schoch 898a6b1e85 fix errcheck issues 2017-12-05 13:32:57 -05:00
Marty Schoch ece27ef215 adding initial version of bolt persisted segment 2017-12-05 13:05:12 -05:00
Marty Schoch f6be841668 add test for postings list count method 2017-12-05 13:01:36 -05:00
Marty Schoch 30e9d6daa5 add better testing of array positions 2017-12-05 12:54:44 -05:00
Marty Schoch 8d9d45115f add test of location field 2017-12-05 12:20:06 -05:00
Marty Schoch 8f0350865b add test for segment fields method 2017-12-05 12:17:56 -05:00
Marty Schoch 7a6b5483f2 add validation that all locations were seen 2017-12-05 11:58:05 -05:00
Marty Schoch e08fdab54a remove todo item 2017-12-05 10:13:27 -05:00
Marty Schoch 87e2627551 added dictionary tests to mem segment 2017-12-05 09:49:41 -05:00
Marty Schoch ed067f45dd added Close() method to Segment 2017-12-05 09:31:02 -05:00
Marty Schoch 22ffc8940e update segment API to return error in key places 2017-12-04 18:06:06 -05:00
Marty Schoch b74cf4b081 add copyright header to all new files in scorch 2017-12-01 15:42:50 -05:00
Marty Schoch 89aa02cf5b fix highlighting of composite fields
updated log statements for refactored names
2017-12-01 15:12:08 -05:00
Marty Schoch cff14f1212 fix crash in DocNumbers when segment is empty 2017-12-01 09:50:27 -05:00
Marty Schoch eb256f78bc switch to constant referring to id field id 0
this avoids potentially mutating something that is intended
to be immutable
2017-12-01 09:30:07 -05:00
Marty Schoch 7c964de8bf switch to binary search for finding segment from global doc num
added unit tests for this function specifically
2017-12-01 09:26:51 -05:00
Marty Schoch c2047dcdf9 refactor doc id reader creation to share more code
fix issue identified by steve
2017-12-01 08:54:39 -05:00
Marty Schoch bcd4bdc3d1 added initial bolt thought to README 2017-12-01 07:27:04 -05:00
Marty Schoch 395458ce83 refactor to make mem segment contents exported 2017-12-01 07:26:47 -05:00
Marty Schoch f521d80835
Merge pull request #645 from steveyen/scorch
scorch InternalID() handles case of unknown docId
2017-12-01 07:21:26 -05:00
Steve Yen 398dcb19b3 scorch introducer uses the roaring.Or(x, y) API
Instead of cloning an input bitmap, the roaring.Or(x, y)
implementation fills a brand new result bitmap, which should be allow
for more efficient packing and memory utilization.
2017-11-30 10:37:10 -08:00
Steve Yen 67986d41bf scorch InternalID() handles case of unknown docId 2017-11-30 08:36:01 -08:00
Marty Schoch 848aca4639 fix issues identified by errcheck 2017-11-29 13:34:15 -05:00
Marty Schoch 23f6dc1cc6 working in-memory version 2017-11-29 11:33:35 -05:00
Marty Schoch 6eea5b78da Merge pull request #631 from dvrkps/patch-1
travis: update go versions
2017-09-12 09:10:15 -04:00
Davor Kapsa f0503355da travis: update go versions 2017-09-12 10:56:33 +02:00
Marty Schoch c048833fcd added stringer method to phrase part
a failing test was producing unhelpful pointer addresses as
the only debug output.  this changes the output to print
the terms and locations as readable text

part of #629
2017-09-01 09:16:08 -04:00
Marty Schoch 930c06dfec rewrote logic to be more obvious
found during code walkthrough on 8/24/2017
2017-08-25 09:30:16 -07:00
Marty Schoch b7a51dae2a Merge pull request #625 from steveyen/master
remove unused Document.Number property
2017-08-24 17:08:20 -07:00
Steve Yen 546700b2de fix comment typo 2017-08-24 16:25:10 -07:00
Steve Yen 87115cbfb7 remove unused Document.Number property 2017-08-24 16:21:26 -07:00
Marty Schoch 82a101aedd Merge pull request #623 from mschoch/fix-race-518
fix data race in doc id search
2017-08-08 08:17:03 -04:00
Marty Schoch cea119449e fix data race in doc id search
the implementation of the doc id search requires that the list
of ids be sorted.  however, when doing a multisearch across
many indexes at once, the list of doc ids in the query is shared.
deeper in the implementation, the search of each shard attempts
to sort this list, resulting in a data race.

this is one example of a potentially larger problem, however
it has been decided to fix this data race, even though larger
issues of data owernship may remain unresolved.

this fix makes a copy of the list of doc ids, just prior to
sorting the list.  subsequently, all use of the list is on the
copy that was made, not the original.

fixes #518
2017-08-07 15:11:35 -04:00
Marty Schoch 174f8ed44a Merge pull request #615 from ethantkoenig/fix/camel_case
Fix token start/end/position values in camelCase tokenizer
2017-06-28 13:18:15 -04:00
Ethan Koenig 0433f05d9c Fix test 2017-06-22 18:56:28 -04:00
Ethan Koenig 8994ad2e00 Fix token start/end/position values in camelCase tokenizer 2017-06-22 17:42:39 -04:00