baddd80069
implement review suggestions
2021-10-25 18:29:12 +02:00
430e9b13d3
add csv builder tests
2021-10-25 10:26:43 +02:00
0f86d6b28f
implement csv serialization
2021-10-25 10:26:42 +02:00
86ead92ed5
infer primary key on sorted fields
2021-10-12 11:15:11 +02:00
9a266a531b
test correct primary key inference
2021-10-12 11:08:53 +02:00
f65153ad64
stop casting integer docids to string
2021-09-28 18:35:54 +02:00
aa6c5df0bc
Implement documents format
...
document reader transform
remove update format
support document sequences
fix document transform
clean transform
improve error handling
add documents! macro
fix transform bug
fix tests
remove csv dependency
Add comments on the transform process
replace search cli
fmt
review edits
fix http ui
fix clippy warnings
Revert "fix clippy warnings"
This reverts commit a1ce3cd96e603633dbf43e9e0b12b2453c9c5620.
fix review comments
remove smallvec in transform loop
review edits
2021-09-21 16:58:33 +02:00
9961b78b06
Drop sorter before creating a new one
2021-09-08 13:30:26 +02:00
1d314328f0
Plug new indexer
2021-09-01 16:48:36 +02:00
838ed1cd32
Use an u16 field id instead of one byte
2021-07-06 11:58:03 +02:00
0cca2ea24f
Return a MissingDocumentId when a document doesn't have one
2021-06-22 11:22:33 +02:00
daef43f504
Rename FieldsDistribution into FieldDistribution
2021-06-21 15:57:41 +02:00
d08cfda796
convert the field_distribution to a BTreeMap and avoid counting twice the same documents
2021-06-17 18:31:54 +02:00
969adaefdf
rename fields_distribution in field_distribution
2021-06-17 15:16:20 +02:00
9716fb3b36
format the whole project
2021-06-16 18:33:33 +02:00
713acc408b
Introduce the primary key to the Settings builder structure
2021-06-16 11:03:36 +02:00
28c004aa2c
Prefer using constant for the database names
2021-06-15 11:13:04 +02:00
312c2d1d8e
Use the Error enum everywhere in the project
2021-06-14 16:58:38 +02:00
65b1d09d55
Move the obkv merging functions into the merge_function module
2021-06-14 16:48:51 +02:00
ff9414a6ba
Use the out of the compute_primary_key_pair function
2021-06-14 16:48:50 +02:00
57898d8a90
fix silent deserialize error
2021-06-03 10:42:55 +02:00
eeb0c70ea2
meilisearch compatible primary key inference
2021-05-06 22:42:32 +02:00
f8d0f5265f
fix(update): fields distribution after documents merge
2021-05-04 22:12:20 +03:00
2658c5c545
feat(index): update fields distribution in clear & delete operations
...
fixes after review
bump the version of the tokenizer
implement a first version of the stop_words
The front must provide a BTreeSet containing the stop words
The stop_words are set at None if an empty Set is provided
add the stop-words in the http-ui interface
Use maplit in the test
and remove all the useless drop(rtxn) at the end of all tests
Integrate the stop_words in the querytree
remove the stop_words from the querytree except if it was a prefix or a typo
more fixes after review
2021-04-01 19:12:35 +03:00
27c7ab6e00
feat(index): store fields distribution in index
2021-04-01 18:35:19 +03:00
e08b6b3ec7
add primary key to fields_id_map when not present
2021-03-01 16:10:16 +01:00
89ce4e74fe
Do not change the primary key type when we serialize documents
2021-02-15 21:24:36 +01:00
e8639517da
Change the project to become a workspace with milli as a default-member
2021-02-12 16:15:09 +01:00