Silvan Jegen
a8ec7f7af2
Add tests for the Kagome tokenizer
2014-09-13 17:45:22 +02:00
Silvan Jegen
ebf100c097
Add the Kagome tokenizer for Japanese
2014-09-13 17:45:19 +02:00
Marty Schoch
198ca1ad4d
major refactor of kvstore/index internals, see below
...
In the index/store package
introduce KVReader
creates snapshot
all read operations consistent from this snapshot
must close to release
introduce KVWriter
only one writer active
access to all operations
allows for consisten read-modify-write
must close to release
introduce AssociativeMerge operation on batch
allows efficient read-modify-write
for associative operations
used to consolidate updates to the term summary rows
saves 1 set and 1 get op per shared instance of term in field
In the index package
introduced an IndexReader
exposes a consisten snapshot of the index for searching
At top level
All searches now operate on a consisten snapshot of the index
2014-09-12 17:21:35 -04:00
Marty Schoch
7819deb447
added boltdb benchmark, same as others
2014-09-12 16:55:50 -04:00
Marty Schoch
2294b24b9d
remove forestdb for now
...
not any benfefit in maintaining this for the time being
2014-09-12 16:55:11 -04:00
Marty Schoch
8c16d68c00
include cjk analyzer in default config
2014-09-11 10:44:14 -04:00
Marty Schoch
1a1cf32a86
introducing cjk_bigram filter and cjk analyzer
...
closes #34
2014-09-11 10:39:05 -04:00
Marty Schoch
cb5ccd2b1d
fix whitespace tokenizer
...
previously would fail to split ascii running into ideographic
2014-09-11 10:38:02 -04:00
Marty Schoch
8debf26cb7
changed many components to not have defaults
...
many of these defaults were arbitrary, and not having
defaults lets us more easily flag them for configuration
added a shingle filter
introduce new toke type for shingles
2014-09-09 18:15:14 -04:00
Marty Schoch
8dd8fb8910
fix compilation
2014-09-07 14:13:32 -04:00
Marty Schoch
6b4c86b35a
changed whitespace tokenizer to work better on cjk input
...
now it will return each cjk character as a separate token
this will pair well with a cjk bigram filter for indexing
2014-09-07 14:11:01 -04:00
Marty Schoch
933d99c576
rename the configurable token map from standard to custom
...
this makes it consistent with the "custom" analyzer
which operates similarly
also, added it to the config.go so its registerd and
available for use
2014-09-07 14:09:38 -04:00
Marty Schoch
22911888c4
refactor registry package and bleve_registry utility
2014-09-07 14:07:42 -04:00
Marty Schoch
9e78643bad
icu tokenier uses brk status to set token type
...
part of #34
2014-09-07 10:24:02 -04:00
Marty Schoch
44df73d317
apply doc fix patch from rakoo
...
closes #95
2014-09-07 09:09:47 -04:00
Marty Schoch
f87a22e24c
added json struct tag to http doc count response
2014-09-05 12:16:26 -04:00
Marty Schoch
b1dd4215fc
added features to readme
2014-09-04 15:09:19 -04:00
Marty Schoch
f384f9dead
added link to wiki search to readme
2014-09-04 14:43:25 -04:00
Marty Schoch
d90697f725
added features to readme
2014-09-04 14:31:26 -04:00
Marty Schoch
afdb5f057f
added convenience method to add field to highlight request
2014-09-04 10:13:13 -04:00
Marty Schoch
9d2187706e
another round of golint
2014-09-03 19:53:59 -04:00
Marty Schoch
8b9255f52f
even more golint cleanups
2014-09-03 19:32:27 -04:00
Marty Schoch
e21935f850
another round of golint cleanup
2014-09-03 19:16:46 -04:00
Marty Schoch
e1b77956d4
more golint cleanups
2014-09-03 18:47:02 -04:00
Marty Schoch
377ae090d0
additional golint issues resolved
2014-09-03 18:17:26 -04:00
Marty Schoch
d534b0836b
converted ALL_CAPS constants to CamelCase
2014-09-03 17:48:40 -04:00
Marty Schoch
53b25195d6
further refactoring of index mappings
2014-09-03 16:40:10 -04:00
Marty Schoch
7fbd44224d
get correct field first, then use it for looking up related
2014-09-03 16:09:51 -04:00
Marty Schoch
8e6c8e5644
continued refactoring of the mapping code
...
also renamed some constant that didnt follow go convetions
2014-09-03 13:02:10 -04:00
Marty Schoch
a151bda2ad
moved some logic from mapping_index to mapping_document
...
part of #92
2014-09-03 10:51:21 -04:00
Marty Schoch
28980c4da1
fix issues identified by go lint
2014-09-02 17:40:46 -04:00
Marty Schoch
d75d836c09
change another variable capitalization
2014-09-02 14:22:21 -04:00
Marty Schoch
bbc6fadf69
changed error constants to camel case
...
all caps constants are not idiomatic go
2014-09-02 14:14:05 -04:00
Marty Schoch
f6a3831687
remove some unused vars
2014-09-02 13:58:27 -04:00
Marty Schoch
45e1b2dfc6
removing gouchstore store impl
...
this implementation didn't really adhere to the contract
and now that we have boltdb we have a better pure go impl
2014-09-02 13:56:35 -04:00
Marty Schoch
7a7eb2e94c
add newline between license and package
...
this avoids cluttering godocs with the license
2014-09-02 10:54:50 -04:00
Marty Schoch
a1f0c02cab
remove flag that is no longer used
2014-09-02 10:27:38 -04:00
Marty Schoch
3bc165d77b
renamed/moved examples/bleve_index_json to utils/bleve_index
2014-09-01 16:14:29 -04:00
Marty Schoch
7c0ea53ea2
added utility bleve_create
2014-09-01 14:54:47 -04:00
Marty Schoch
5d435bd022
moved bleve_query from examples to utils
2014-09-01 14:45:02 -04:00
Marty Schoch
ac6176f14c
removed old example from gitignore
2014-09-01 14:37:22 -04:00
Marty Schoch
68a332bc5a
fix broken test crashing
2014-09-01 14:36:46 -04:00
Marty Schoch
97f6b3980b
moved beer-sample and associated data to its own repo
2014-09-01 14:27:52 -04:00
Marty Schoch
831b82832e
added ability to pring mapping
2014-09-01 14:17:05 -04:00
Marty Schoch
f28d00de87
fix so index mapping properly registers analysis on deserialization
2014-09-01 14:16:31 -04:00
Marty Schoch
1dcd06e412
add ability to define custom analysis as part of index mapping
...
now, as part of your index mapping you can create custom
analysis components. these custome analysis components
are serialized as part of the mapping, and reused
as you would expect on subsequent accesses.
2014-09-01 13:55:23 -04:00
Marty Schoch
2ee7289bc8
major refactor of search package
...
this started initially to relocate highlighting into
a self contained package, which would then also use
the registry
however, it turned into a much larger refactor in
order to avoid cyclic imports
now facets, searchers, scorers and collectors
are also broken out into subpackages of search
2014-09-01 11:15:38 -04:00
Marty Schoch
209f808722
improve go docs at the top level
...
part of #79
2014-08-31 10:55:22 -04:00
Marty Schoch
862205f184
fix deadlock
...
Search() would attempt to reacquire mutex when invoking Document()
should instead call index.Document(), read mutex is already
acquired
closes #87
2014-08-30 14:49:16 -04:00
Marty Schoch
ac0c2b05fe
reproduce bug #87
2014-08-30 14:49:02 -04:00