0
0
Fork 0
Commit Graph

29 Commits

Author SHA1 Message Date
Steve Yen 7a19e6fd7e scorch zap replace locsBitmap w/ 1 bit from freq-norm varint encoding
This is attempt #2 of the optimization that replaces the locsBitmap,
without any changes from the original commit attempt.  A commit that
follows this one contains the actual fix.

See also...
- commit 621b58dd83 (the 1st attempt)
- commit 49a4ee60ba (the revert)

-------------
The original commit message body from 621b58 was...

NOTE: this is a zap file format change.

The separate "postings locations" roaring Bitmap that encoded whether
a posting has locations info is now replaced by the least significant
bit in the freq varint encoded in the freq-norm chunkedIntCoder.

encode/decodeFreqHasLocs() are added as helper functions.
2018-03-23 12:50:24 -07:00
Steve Yen 49a4ee60ba Revert "scorch zap replace locsBitmap w/ 1 bit from freq-norm varint encoding"
Testing with the cbft application led to cbft process exits...

  AsyncError exit()... error reading location field: EOF --
  main.initBleveOptions.func1() at init_bleve.go:85

This reverts commit 621b58dd83.
2018-03-23 10:01:30 -07:00
Steve Yen 67f75005c4 fix cmd/bleve help string for internal command 2018-03-22 17:43:07 -07:00
Steve Yen 621b58dd83 scorch zap replace locsBitmap w/ 1 bit from freq-norm varint encoding
NOTE: this is a zap file format change.

The separate "postings locations" roaring Bitmap that encoded whether
a posting has locations info is now replaced by the least significant
bit in the freq varint encoded in the freq-norm chunkedIntCoder.

encode/decodeFreqHasLocs() are added as helper functions.
2018-03-22 17:43:07 -07:00
Steve Yen 6b78dd4184 fix cmd/bleve scorch ascii cmd help text
Initially, there was a typo with an extra space char, but then I
realized there was some copypasting corrections.
2018-03-22 06:48:42 -07:00
Marty Schoch e9b228bcdd improve command-line tool for zap
correctly handle/print additional loc bitmap address
this fixes bitmap length that is output
instantiate roaring bitmap and print it out
removed some unnecessary debug logging

updated dict command to print 1-hit encoded vals
this makes dict command usable for seeing which
doc ids are in a segment and their corresponding doc number
2018-03-19 14:57:30 -04:00
Sreekanth Sivasankaran 19318194fa moving to new offset slice format 2018-03-13 14:06:48 +05:30
Sreekanth Sivasankaran aaccf59191 docValue space savings
merging the doc value length and loc
slices into a single offset slice  as that
is enough to compute the starting offset and
length of the the doc values data for a given
document inside a docValue chunk.
2018-03-12 15:36:46 +05:30
Steve Yen eac9808990 scorch zap optimize FST val encoding for terms with 1 hit
NOTE: this is a scorch zap file format change / bump to version 4.

In this optimization, the uint64 val stored in the vellum FST (term
dictionary) now may either be a uint64 postingsOffset (same as before
this change) or a uint64 encoding of the docNum + norm (in the case
where a term appears in just a single doc).
2018-03-08 09:19:54 -08:00
Steve Yen c09e2a08ca scorch zap chunkedContentCoder reuses chunk metadata slice memory
And, renamed the chunk MetaData.DocID field to DocNum for naming
correctness, where much of this commit is the mechanical effect of
that rename.
2018-02-05 07:39:16 -08:00
Tom Kralidis 637fad78a5
fix minor typo 2018-01-06 21:04:03 -05:00
Marty Schoch 57a075afdb improving command-line tool for scorch 2018-01-05 11:50:07 -05:00
Marty Schoch c691cd2bb5 refactor scorch/zap command-line tools under bleve
zap command-line tool added to main bleve command-line tool
this required physical relocation due to the vendoring used
only on the bleve command-line tool (unforseen limitation)

a new scorch command-line tool has also been introduced
and for the same reasons it is physically store under
the top-level bleve command-line tool as well
2018-01-05 10:17:18 -05:00
R. S. Doiel 1d38f4791d added hyphen in query sort by option 2017-05-18 11:27:51 -07:00
R. S. Doiel c1db96946a Added -sortby, -b to query bleve command 2017-05-18 11:16:46 -07:00
Michael Lustfield c26af21050 Added name to copyright notice 2017-03-28 12:17:26 -05:00
Marty Schoch 9ca14d29f1 use cobra command Annotations to identify mutating commands
some downstream applications need to know which sub-commands
may alter the index.  this new function allows them to be
identified, while not prescribing any particular behavior.
2017-02-15 15:37:26 -05:00
Marty Schoch c937e971af bump manifest to latest cobra and pflag 2017-02-15 15:36:47 -05:00
Marty Schoch 1061636c9d vendor mousetrap
our policy has been to vendor the libraries required for the
command-line portion of the bleve command-line tool.  this
included spf13/cobra etc, unfortunately we missed this library
as it is only required for windows users.
2016-10-24 13:54:29 -04:00
Marty Schoch 9174872ba2 add the bleve check tool
bleve check was a consistency checking tool originally developed
as a part of cbft.  currently it checks that the term dictionary
counts match the number of postings for the term.  in the future
additional checks could be added.  this tool has been back
ported to bleve as we've now adopted a single common tool for
both cbft and bleve.
2016-10-22 06:11:50 -04:00
Marty Schoch 2f48d7fb02 fix misspellings 2016-10-02 12:11:15 -04:00
Marty Schoch 2332455bd2 nicer formatting of license header 2016-10-02 10:13:14 -04:00
Marty Schoch f90856b8d3 BREAKING CHANGE - rename upside_down to upsidedown 2016-09-30 12:36:38 -04:00
Marty Schoch 9ec2ddd757 initial refactor of query into separate package 2016-09-29 14:54:16 -04:00
Marty Schoch 3cf7e00b50 remove binary accidentally committed to repo
update .gitignore to prevent this in the future
2016-09-27 13:05:50 -04:00
Marty Schoch 0236043f65 rewrite links suitable for blevesearch website 2016-09-21 12:58:18 -04:00
Marty Schoch 0d52d2f8ea add build tag to ignore gendocs by default 2016-09-20 13:58:59 -04:00
Marty Schoch 81e676de79 improved usage and added utility to generate markdown docs 2016-09-20 13:42:45 -04:00
Marty Schoch c87cf35ace migrated all bleve utils into single bleve command
used spf13/cobra to make it awesome
and attempting to vendor this new dep
2016-09-14 11:52:29 -04:00