f156d7dd3b
Stop reindexing already indexed documents
2022-08-02 10:31:20 +02:00
e1bc610d27
Merge #595
...
595: Update version for next release (v0.32.0) r=ManyTheFish a=curquiza
In order to release on `main` (for v0.29.0, not v0.28.1)
<img width="1014" alt="Capture d’écran 2022-07-21 à 13 20 35" src="https://user-images.githubusercontent.com/20380692/180178725-381fbdf1-c0fb-4fa9-9954-452aec5a1574.png ">
Co-authored-by: Clémentine Urquizar <clementine@meilisearch.com >
2022-07-21 11:07:42 +00:00
d5e9b7305b
Update version for next release (v0.32.0)
2022-07-21 13:20:02 +04:00
941af58239
Merge #561
...
561: Enriched documents batch reader r=curquiza a=Kerollmops
~This PR is based on #555 and must be rebased on main after it has been merged to ease the review.~
This PR contains the work in #555 and can be merged on main as soon as reviewed and approved.
- [x] Create an `EnrichedDocumentsBatchReader` that contains the external documents id.
- [x] Extract the primary key name and make it accessible in the `EnrichedDocumentsBatchReader`.
- [x] Use the external id from the `EnrichedDocumentsBatchReader` in the `Transform::read_documents`.
- [x] Remove the `update_primary_key` from the _transform.rs_ file.
- [x] Really generate the auto-generated documents ids.
- [x] Insert the (auto-generated) document ids in the document while processing it in `Transform::read_documents`.
Co-authored-by: Kerollmops <clement@meilisearch.com >
2022-07-21 07:08:50 +00:00
41a0ce07cb
Add a code comment, as suggested in PR review
...
Co-authored-by: Many the fish <many@meilisearch.com >
2022-07-20 16:20:35 +02:00
5704235521
Merge #584
...
584: Chores: Enhance smart-crop code comments r=curquiza a=ManyTheFish
Enhance explanation around smart crop algorithms
Co-authored-by: ManyTheFish <many@meilisearch.com >
Co-authored-by: Many the fish <many@meilisearch.com >
2022-07-19 07:08:14 +00:00
f6415b679f
Merge #588
...
588: Fix name of "release_date" facet in movies benchmarks r=ManyTheFish a=loiclec
## What does this PR do?
The `movies.json` file in the benchmark datasets contains a filterable field called "release_date", but the indexing benchmarks wrongly called the field "released_date" instead. This PR fixes that.
Co-authored-by: Loïc Lecrenier <loic@meilisearch.com >
2022-07-18 15:51:09 +00:00
2d79720f5d
Update milli/src/search/matches/mod.rs
2022-07-18 17:48:04 +02:00
8ddb4e750b
Update milli/src/search/matches/mod.rs
2022-07-18 17:47:39 +02:00
a277daa1f2
Update milli/src/search/matches/mod.rs
2022-07-18 17:47:13 +02:00
fb794c6b5e
Update milli/src/search/matches/mod.rs
2022-07-18 17:46:00 +02:00
1237cfc249
Update milli/src/search/matches/mod.rs
2022-07-18 17:45:37 +02:00
d7fd5c58cd
Update milli/src/search/matches/mod.rs
2022-07-18 17:45:06 +02:00
fc9f3f31e7
Change DocumentsBatchReader to access cursor and index at same time
...
Otherwise it is not possible to iterate over all documents while
using the fields index at the same time.
2022-07-18 16:08:14 +02:00
ab1571cdec
Simplify Transform::read_documents, enabled by enriched documents reader
2022-07-18 12:45:47 +02:00
8270e2b768
Fix name of "release_date" facet in movies benchmarks
2022-07-18 10:34:12 +02:00
e261ef64d7
Update milli/src/search/matches/mod.rs
...
Co-authored-by: Clément Renault <clement@meilisearch.com >
2022-07-18 10:18:51 +02:00
1da4ab5918
Update milli/src/search/matches/mod.rs
...
Co-authored-by: Clément Renault <clement@meilisearch.com >
2022-07-18 10:18:03 +02:00
448114cc1c
Fix the benchmarks with the new indexation API
2022-07-12 15:22:09 +02:00
25e768f31c
Fix another issue with the nested primary key selector
2022-07-12 15:14:07 +02:00
192793ee38
Add some tests to check for the nested documents ids
2022-07-12 15:14:07 +02:00
a892a4a79c
Introduce a function to extend from a JSON array of objects
2022-07-12 15:14:06 +02:00
dc61105554
Fix the nested document id fetching function
2022-07-12 15:14:06 +02:00
2eec290424
Check the validity of the latitute and longitude numbers
2022-07-12 15:14:06 +02:00
5d149d631f
Remove tests for a function that no more exists
2022-07-12 15:14:06 +02:00
0bbcc7b180
Expose the DocumentId
struct to be sure to inject the generated ids
2022-07-12 15:14:06 +02:00
d1a4da9812
Generate a real UUIDv4 when ids are auto-generated
2022-07-12 15:14:06 +02:00
c8ebf0de47
Rename the validate function as an enriching function
2022-07-12 15:14:06 +02:00
905af2a2e9
Use the primary key and external id in the transform
2022-07-12 15:14:05 +02:00
742543091e
Constify the default primary key name
2022-07-12 14:55:52 +02:00
5f1bfb73ee
Extract the primary key name and make it accessible
2022-07-12 14:55:52 +02:00
6a0a0ae94f
Make the Transform read from an EnrichedDocumentsBatchReader
2022-07-12 14:55:52 +02:00
ea852200bb
Fix the format used for a geo deleting benchmark
2022-07-12 14:55:52 +02:00
dc3f092d07
Do not leak an internal grenad Error
2022-07-12 14:55:52 +02:00
8ebf5eed0d
Make the nested primary key work
2022-07-12 14:55:52 +02:00
19eb3b4708
Make sur that we do not accept floats as documents ids
2022-07-12 14:55:52 +02:00
2ceeb51c37
Support the auto-generated ids when validating documents
2022-07-12 14:55:51 +02:00
399eec5c01
Fix the indexation tests
2022-07-12 14:55:51 +02:00
fcfc4caf8c
Move the Object type in the lib.rs file and use it everywhere
2022-07-12 14:55:51 +02:00
0146175fe6
Introduce the validate_documents_batch function
2022-07-12 14:55:51 +02:00
cefffde9af
Improve the .gitignore of the fuzz crate
2022-07-12 14:55:51 +02:00
bdc4263883
Introduce the validate_documents_batch function
2022-07-12 14:55:51 +02:00
a97d4d63b9
Fix the benchmarks
2022-07-12 14:55:50 +02:00
f29114f94a
Fix http-ui to fit with the new DocumentsBatchBuilder/Reader structs
2022-07-12 14:52:56 +02:00
a4ceef9624
Fix the cli for the new DocumentsBatchBuilder/Reader structs
2022-07-12 14:52:56 +02:00
6d0498df24
Fix the fuzz tests
2022-07-12 14:52:56 +02:00
e8297ad27e
Fix the tests for the new DocumentsBatchBuilder/Reader
2022-07-12 14:52:56 +02:00
419ce3966c
Rework the DocumentsBatchBuilder/Reader to use grenad
2022-07-12 14:52:55 +02:00
eb63af1f10
Update grenad to 0.4.2
2022-07-12 14:52:55 +02:00
048e174efb
Do not allocate when parsing CSV headers
2022-07-12 14:52:55 +02:00