Commit Graph

119 Commits

Author SHA1 Message Date
Clément Renault
54dacb362d Use different algorithms for different documents ratios 2020-01-14 17:51:08 +01:00
Clément Renault
6edb460bea Try with an exponential search 2020-01-14 16:52:24 +01:00
Clément Renault
40dab80dfa Change the way we filter the documents 2020-01-14 14:18:01 +01:00
Clément Renault
681711fced Fix query ids to be usize 2020-01-14 13:12:42 +01:00
Clément Renault
21c1473e0c Introduce the distance data 2020-01-14 11:38:04 +01:00
Clément Renault
8acbdcbbad wip: Make the new query tree work with the criteria 2020-01-13 14:36:06 +01:00
Clément Renault
da8abebfa2 Introduce the query words mapping along with the query tree 2020-01-13 13:29:47 +01:00
Clément Renault
4f7a7ea0bb Faster intersection group by 2020-01-09 16:30:03 +01:00
Clément Renault
d6c9ba8f08 Store the postings lists 2020-01-09 15:04:53 +01:00
Clément Renault
ec8916bf54 Change the debug outputs 2020-01-09 12:05:39 +01:00
Clément Renault
81c573ec92 Add the raw document IDs to the postings lists 2020-01-08 15:30:43 +01:00
Clément Renault
9420edadf4 Introduce the Postings type to decorrelate the DocumentIds 2020-01-08 14:48:23 +01:00
Clément Renault
d724a7659e Introduce a query tree context struct 2020-01-08 13:37:22 +01:00
Clément Renault
887c212b49 Add more logs about the docids construction 2020-01-08 13:22:42 +01:00
Clément Renault
07937ed6d7 Use the prefix caches 2020-01-08 13:14:07 +01:00
Clément Renault
a262c67ec3 limit the search in the FST 2020-01-08 13:06:12 +01:00
Clément Renault
13ca30c4d8 WIP: Made the query tree traversing support prefix search 2020-01-08 12:02:58 +01:00
Clément Renault
fbcec2975d wip: Impl a basic tree traversing 2020-01-07 18:24:13 +01:00
Clément Renault
6e1f4af833 wip: Create a tree from query but need to show synonyms 2020-01-07 18:24:13 +01:00
Clément Renault
856c5c4214 Fix group offset computing 2019-12-31 14:24:10 +01:00
Clément Renault
670e80c151 Use the cached postings lists in the query system 2019-12-31 13:32:36 +01:00
Clément Renault
eed07c724f Add more logging for postings lists fetching by word 2019-12-31 13:32:36 +01:00
Clément Renault
99d35fb940 Introduce a first version of a number of candidates reducer
It works by ignoring the postings lists associated to documents that the previous words did not returned
2019-12-31 13:32:36 +01:00
Clément Renault
106b886873 Cache the prefix postings lists 2019-12-30 18:01:32 +01:00
Clément Renault
928876b553 Introduce the postings lists caching stores
Currently not used
2019-12-30 18:01:27 +01:00
Clément Renault
58836d89aa Rename the PrefixCache into PrefixDocumentsCache 2019-12-30 15:42:09 +01:00
Clément Renault
1a5a104f13 Display proximity evaluation number of calls 2019-12-30 15:42:09 +01:00
Clément Renault
064cfa4755 Add more debug, where are those 100ms 2019-12-30 15:42:08 +01:00
Clément Renault
ed6172aa94 Add a time measurement of the criterion loop 2019-12-30 15:42:08 +01:00
Clément Renault
8c140f6bcd Increase the disk usage limit 2019-12-30 15:42:08 +01:00
Clément Renault
1e1f0fcaf5 Introduce a basic cache system for first letters 2019-12-30 15:42:08 +01:00
Clément Renault
d21352a109 Change the time measurement of the FST 2019-12-30 15:42:08 +01:00
Clément Renault
4be11f961b Use an ugly trick to avoid cloning the FST 2019-12-30 15:42:07 +01:00
Clément Renault
1163f390b3 Restrict FST search to the first letter of the word 2019-12-30 15:42:07 +01:00
Clément Renault
691e2a3c1d Fix a blocking channel, appearing like a deadlock 2019-12-30 15:28:28 +01:00
Clément Renault
04bb49989f Add more debug timings 2019-12-20 14:18:48 +01:00
Clément Renault
d12ff15ee3 Set the indexes info in the create_index function 2019-12-19 10:38:56 +01:00
Clément Renault
40c0b14d1c Reintroduce searchable attributes and reordering 2019-12-13 14:38:25 +01:00
Clément Renault
a4dd033ccf Rename raw_matches into bare_matches 2019-12-13 14:38:25 +01:00
Clément Renault
48e8778881 Clean up the modules declarations 2019-12-13 14:38:25 +01:00
Clément Renault
4be23efe66 Remove the AttrCount type
Could probably be reintroduced later
2019-12-13 14:38:25 +01:00
Clément Renault
7d67750865 Reintroduce exacteness for one word document field 2019-12-13 14:38:25 +01:00
Clément Renault
746e6e170c Make the test pass again 2019-12-13 14:38:24 +01:00
Clément Renault
d93e35cace Introduce ContextMut and Context structs 2019-12-13 14:38:24 +01:00
Clément Renault
d75339a271 Prefer summing the attribute 2019-12-13 14:38:24 +01:00
Clément Renault
86ee0cbd6e Introduce bucket_sort_with_distinct function 2019-12-13 14:38:24 +01:00
Clément Renault
248ccfc0d8 Update the criteria to the new ones 2019-12-13 14:38:24 +01:00
Clément Renault
ea148575cf Remove the raw_query functions 2019-12-13 14:38:23 +01:00
Clément Renault
efc2be0b7b Bump the sdset dependency to 0.3.6 2019-12-13 14:38:23 +01:00
Clément Renault
8d71112dcb Rewrite the phrase query postings lists
This simplified the multiword_rewrite_matches function a little bit.
2019-12-13 14:38:23 +01:00