Marty Schoch
d7eb223e14
remove bolt segment format
...
upcomning breaking changes and no desire to maintain
2017-12-11 10:20:26 -05:00
Marty Schoch
eada7b209b
fix test issue identified by sreekanth
2017-12-11 10:16:56 -05:00
Marty Schoch
8280859bb8
handle read-only and in-mem only cases
2017-12-11 09:07:01 -05:00
Marty Schoch
e8cc7ac0bf
add new fields command to zap cmd-line util
2017-12-11 09:05:50 -05:00
Marty Schoch
690cd39921
add crazy slow but functional DocumentVisitFieldTerms
2017-12-10 08:55:59 -05:00
Marty Schoch
dc0adc8827
add fsync
2017-12-09 20:52:01 -05:00
Marty Schoch
e0d9828cd0
add more detail to the readme
2017-12-09 14:42:36 -05:00
Marty Schoch
414899618b
switch from bolt format to zap in the persister
2017-12-09 14:28:50 -05:00
Marty Schoch
9781d9b089
add initial version of zap file format
2017-12-09 14:28:33 -05:00
Marty Schoch
ff2e6b98e4
added empty segment
2017-12-09 12:43:02 -05:00
Marty Schoch
e470105635
fix issues identified by errcheck
2017-12-06 18:36:14 -05:00
Marty Schoch
adac4f41db
initial version of scorch which persists index to disk
2017-12-06 18:33:47 -05:00
Marty Schoch
b1346b4c8a
add readme describing our use of bolt as a segment format
2017-12-05 16:09:00 -05:00
Marty Schoch
898a6b1e85
fix errcheck issues
2017-12-05 13:32:57 -05:00
Marty Schoch
ece27ef215
adding initial version of bolt persisted segment
2017-12-05 13:05:12 -05:00
Marty Schoch
f6be841668
add test for postings list count method
2017-12-05 13:01:36 -05:00
Marty Schoch
30e9d6daa5
add better testing of array positions
2017-12-05 12:54:44 -05:00
Marty Schoch
8d9d45115f
add test of location field
2017-12-05 12:20:06 -05:00
Marty Schoch
8f0350865b
add test for segment fields method
2017-12-05 12:17:56 -05:00
Marty Schoch
7a6b5483f2
add validation that all locations were seen
2017-12-05 11:58:05 -05:00
Marty Schoch
e08fdab54a
remove todo item
2017-12-05 10:13:27 -05:00
Marty Schoch
87e2627551
added dictionary tests to mem segment
2017-12-05 09:49:41 -05:00
Marty Schoch
ed067f45dd
added Close() method to Segment
2017-12-05 09:31:02 -05:00
Marty Schoch
22ffc8940e
update segment API to return error in key places
2017-12-04 18:06:06 -05:00
Marty Schoch
b74cf4b081
add copyright header to all new files in scorch
2017-12-01 15:42:50 -05:00
Marty Schoch
89aa02cf5b
fix highlighting of composite fields
...
updated log statements for refactored names
2017-12-01 15:12:08 -05:00
Marty Schoch
cff14f1212
fix crash in DocNumbers when segment is empty
2017-12-01 09:50:27 -05:00
Marty Schoch
eb256f78bc
switch to constant referring to id field id 0
...
this avoids potentially mutating something that is intended
to be immutable
2017-12-01 09:30:07 -05:00
Marty Schoch
7c964de8bf
switch to binary search for finding segment from global doc num
...
added unit tests for this function specifically
2017-12-01 09:26:51 -05:00
Marty Schoch
c2047dcdf9
refactor doc id reader creation to share more code
...
fix issue identified by steve
2017-12-01 08:54:39 -05:00
Marty Schoch
bcd4bdc3d1
added initial bolt thought to README
2017-12-01 07:27:04 -05:00
Marty Schoch
395458ce83
refactor to make mem segment contents exported
2017-12-01 07:26:47 -05:00
Marty Schoch
f521d80835
Merge pull request #645 from steveyen/scorch
...
scorch InternalID() handles case of unknown docId
2017-12-01 07:21:26 -05:00
Steve Yen
398dcb19b3
scorch introducer uses the roaring.Or(x, y) API
...
Instead of cloning an input bitmap, the roaring.Or(x, y)
implementation fills a brand new result bitmap, which should be allow
for more efficient packing and memory utilization.
2017-11-30 10:37:10 -08:00
Steve Yen
67986d41bf
scorch InternalID() handles case of unknown docId
2017-11-30 08:36:01 -08:00
Marty Schoch
848aca4639
fix issues identified by errcheck
2017-11-29 13:34:15 -05:00
Marty Schoch
23f6dc1cc6
working in-memory version
2017-11-29 11:33:35 -05:00
Joachim Schwarm
4ddc50e86d
typo in documentation
2017-11-21 16:35:07 +01:00
Marty Schoch
6eea5b78da
Merge pull request #631 from dvrkps/patch-1
...
travis: update go versions
2017-09-12 09:10:15 -04:00
Davor Kapsa
f0503355da
travis: update go versions
2017-09-12 10:56:33 +02:00
Marty Schoch
c048833fcd
added stringer method to phrase part
...
a failing test was producing unhelpful pointer addresses as
the only debug output. this changes the output to print
the terms and locations as readable text
part of #629
2017-09-01 09:16:08 -04:00
Marty Schoch
930c06dfec
rewrote logic to be more obvious
...
found during code walkthrough on 8/24/2017
2017-08-25 09:30:16 -07:00
Marty Schoch
b7a51dae2a
Merge pull request #625 from steveyen/master
...
remove unused Document.Number property
2017-08-24 17:08:20 -07:00
Steve Yen
546700b2de
fix comment typo
2017-08-24 16:25:10 -07:00
Steve Yen
87115cbfb7
remove unused Document.Number property
2017-08-24 16:21:26 -07:00
Marty Schoch
82a101aedd
Merge pull request #623 from mschoch/fix-race-518
...
fix data race in doc id search
2017-08-08 08:17:03 -04:00
Marty Schoch
cea119449e
fix data race in doc id search
...
the implementation of the doc id search requires that the list
of ids be sorted. however, when doing a multisearch across
many indexes at once, the list of doc ids in the query is shared.
deeper in the implementation, the search of each shard attempts
to sort this list, resulting in a data race.
this is one example of a potentially larger problem, however
it has been decided to fix this data race, even though larger
issues of data owernship may remain unresolved.
this fix makes a copy of the list of doc ids, just prior to
sorting the list. subsequently, all use of the list is on the
copy that was made, not the original.
fixes #518
2017-08-07 15:11:35 -04:00
Andrey Khomenko
dc9f994d95
Update index.go
2017-07-20 12:06:45 -04:00
Marty Schoch
174f8ed44a
Merge pull request #615 from ethantkoenig/fix/camel_case
...
Fix token start/end/position values in camelCase tokenizer
2017-06-28 13:18:15 -04:00
Ethan Koenig
0433f05d9c
Fix test
2017-06-22 18:56:28 -04:00