a81165f0d8
Merge remote-tracking branch 'origin/main' into search-refactor
2023-04-07 10:15:55 +02:00
130d2061bd
Fix indexing of word_position_docid and fid
2023-04-06 17:50:39 +02:00
66ddee4390
Fix word_position_docids indexing
2023-04-06 17:50:39 +02:00
e58426109a
Fix panics and issues in exactness graph ranking rule
2023-04-06 17:50:39 +02:00
996619b22a
Increase position by 8 on hard separator when building query terms
2023-04-06 17:50:39 +02:00
efea1e5837
Fix facet normalization
2023-03-29 12:02:24 +02:00
9b2653427d
Split position DB into fid and relative position DB
2023-03-23 09:22:01 +01:00
2f8eb4f54a
last PR fixes
2023-03-09 15:34:36 +01:00
5deea631ea
fix clippy too many arguments
2023-03-09 11:19:13 +01:00
b4b859ec8c
Fix typos
2023-03-09 10:58:35 +01:00
24c0775c67
Change indexing threshold
2023-03-08 12:36:04 +01:00
3092cf0448
Fix clippy errors
2023-03-08 10:53:42 +01:00
da48506f15
Rerun extraction when language detection might have failed
2023-03-07 18:35:26 +01:00
bbecab8948
fix clippy
2023-02-21 10:18:44 +01:00
8aa808d51b
Merge branch 'main' into enhance-language-detection
2023-02-20 18:14:34 +01:00
18796d6e6a
Consider null as a valid geo object
2023-02-20 13:45:51 +01:00
d8207356f4
Skip script,language insertion if language is undetected
2023-01-31 11:28:05 +01:00
fd60a39f1c
Format code
2023-01-31 11:28:05 +01:00
d97fb6117e
Extract and index data
2023-01-31 11:28:05 +01:00
d1fc42b53a
Use compatibility decomposition normalizer in facets
2023-01-18 15:02:13 +01:00
8d0ace2d64
Avoid creating a MatchingWord for words that exceed the length limit
2022-11-28 10:20:13 +01:00
ac3baafbe8
Truncate facet values that are too long before indexing them
2022-11-17 11:29:42 +01:00
d95d02cb8a
Fix Facet Indexing bugs
...
1. Handle keys with variable length correctly
This fixes https://github.com/meilisearch/meilisearch/issues/3042 and
is easily reproducible with the updated fuzz tests, which now generate
keys with variable lengths.
2. Prevent adding facets to the database if their encoded value does
not satisfy `valid_lmdb_key`.
This fixes an indexing failure when a document had a filterable
attribute containing a value whose length is higher than ~500 bytes.
2022-11-17 11:29:42 +01:00
70465aa5ce
Execute cargo fmt
2022-11-04 08:59:58 +09:00
3009981d31
Fix clippy errors
...
Add clippy job
Add clippy job to CI
2022-11-04 08:58:14 +09:00
c7322f704c
Fix cargo clippy errors
...
Dont apply clippy for tests for now
Fix clippy warnings of filter-parser package
parent 8352febd646ec4bcf56a44161e5c4dce0e55111f
author unvalley <38400669+unvalley@users.noreply.github.com > 1666325847 +0900
committer unvalley <kirohi.code@gmail.com > 1666791316 +0900
Update .github/workflows/rust.yml
Co-authored-by: Clémentine Urquizar - curqui <clementine@meilisearch.com >
Allow clippy lint too_many_argments
Allow clippy lint needless_collect
Allow clippy lint too_many_arguments and type_complexity
Fix for clippy warnings comparison_chains
Fix for clippy warnings vec_init_then_push
Allow clippy lint should_implement_trait
Allow clippy lint drop_non_drop
Fix lifetime clipy warnings in filter-paprser
Execute cargo fmt
Fix clippy remaining warnings
Fix clippy remaining warnings again and allow lint on each place
2022-10-27 01:04:23 +09:00
54c0cf93fe
Merge remote-tracking branch 'origin/main' into facet-levels-refactor
2022-10-26 15:13:34 +02:00
a034a1e628
Move StrRefCodec and ByteSliceRefCodec to their own files
2022-10-26 13:47:46 +02:00
51961e1064
Polish some details
2022-10-26 13:47:04 +02:00
b1ab09196c
Remove outdated TODOs
2022-10-26 13:47:04 +02:00
9026867d17
Give same interface to bulk and incremental facet indexing types
...
+ cargo fmt, oops, sorry for the bad history :(
2022-10-26 13:47:04 +02:00
485a72306d
Refactor facet-related codecs
2022-10-26 13:47:04 +02:00
afdf87f6f7
Fix bugs in asc/desc criterion and facet indexing
2022-10-26 13:47:04 +02:00
a7201ece04
cargo fmt
2022-10-26 13:47:04 +02:00
61252248fb
Fix some facet indexing bugs
2022-10-26 13:47:04 +02:00
85824ee203
Try to make facet indexing incremental
2022-10-26 13:47:04 +02:00
39a4a0a362
Reintroduce filter range search and facet extractors
2022-10-26 13:46:14 +02:00
c3f49f766d
Prepare refactor of facets database
...
Prepare refactor of facets database
2022-10-26 13:46:14 +02:00
6b2fe94192
Fixes for clippy bringing us down to 18 remaining issues.
...
This brings us a step closer to enforcing clippy on each build.
2022-10-25 20:49:02 +02:00
d76d0cb1bf
Merge branch 'main' into word-pair-proximity-docids-refactor
2022-10-24 15:23:00 +02:00
a7de4f5b85
Don't add swapped word pairs to the word_pair_proximity_docids db
2022-10-18 10:37:34 +02:00
bdeb47305e
Change encoding of word_pair_proximity DB to (proximity, word1, word2)
...
Same for word_prefix_pair_proximity
2022-10-18 10:37:34 +02:00
beb987d3d1
Fixing piles of clippy errors.
...
Most of these are calling clone when the struct supports Copy.
Many are using & and &mut on `self` when the function they are called
from already has an immutable or mutable borrow so this isn't needed.
I tried to stay away from actual changes or places where I'd have to
name fresh variables.
2022-10-13 22:02:54 +02:00
762e320c35
Add proximity calculation for the same word
2022-10-07 12:59:12 +02:00
00c02d00f3
Add missing logging timer to extractors
2022-09-30 22:17:06 +05:30
3794962330
Use an unstable algorithm for grenad::Sorter when possible
2022-09-13 14:49:53 +02:00
fe3973a51c
Make sure that long words are correctly skipped
2022-09-07 15:03:32 +02:00
306593144d
Refactor word prefix pair proximity indexation
2022-08-17 11:59:00 +02:00
07003704a8
Merge branch 'filter/field-exist'
2022-07-21 14:51:41 +02:00
1506683705
Avoid using too much memory when indexing facet-exists-docids
2022-07-19 14:42:35 +02:00