Commit Graph

11823 Commits

Author SHA1 Message Date
Clément Renault
67577a3760 It is an error to merge docid word positions 2020-10-04 17:31:12 +02:00
Clément Renault
ce8e56ee18 Rewrite the indexer to use one MTBL by database
This allows us to avoid prefixing keys and appending into LMDB databases
2020-10-04 17:04:33 +02:00
Clément Renault
770f29fd05 Bump the oxidized-mtbl dependency 2020-10-04 17:04:33 +02:00
Clément Renault
acd2a63879 Introduce a simple FST based chinese word segmenter 2020-10-04 17:04:33 +02:00
Clément Renault
6cc6addc2f Increase the CboRoaringBitmapCodec threshold 2020-10-02 17:06:17 +02:00
Clément Renault
e41a3822a6 Add a simple test for the CboRoaringBitmapCodec 2020-10-02 16:52:36 +02:00
Clément Renault
c4b0c57059 Reduce the default indexer max-memory parameter 2020-10-02 16:47:41 +02:00
Kerollmops
007e647462 Introduce the Mdfs Iterator that explore the proximity graph using a mana DFS 2020-10-02 16:46:07 +02:00
Kerollmops
d4e80407e5 Introduce the mana depth first search algorithm 2020-10-02 16:46:07 +02:00
Kerollmops
f6a8096720 Rename the quartile as percentiles 25th, 50th and 75th 2020-10-02 16:46:07 +02:00
Kerollmops
891e0188dd Introduce the database-stats infos subcommand 2020-10-02 16:46:07 +02:00
Kerollmops
079742b4d3 Clean up the stats and size of database infos subcommands 2020-10-02 16:46:06 +02:00
Kerollmops
d0c73564b1 Use the CboRoaringBitmapCodec for the word pair proximity docids 2020-10-02 16:46:06 +02:00
Kerollmops
5a6a698e1d Introduce the CboRoaringBitmapCodec 2020-10-02 16:46:06 +02:00
Kerollmops
4eda149ffa Rename the BoRoaringBitmap codec 2020-10-02 16:46:06 +02:00
Clément Renault
ac84db2506 Move the words pairs proximities average into the stats infos subcommand 2020-10-02 16:46:06 +02:00
Kerollmops
30755e31e7 Introduce the words pairs proximities stats info subcommand 2020-10-02 16:46:06 +02:00
Clément Renault
bc35c9a598 Introduce the size_of_database infos subcommand 2020-10-02 16:46:05 +02:00
gorogoroumaru
f4d918d22a Merge branch 'master' into issue943 2020-10-02 21:01:31 +09:00
gorogoroumaru
5ef3a01b6c Merge branch 'issue943' of https://github.com/gorogoroumaru/MeiliSearch into issue943 2020-10-02 20:01:13 +09:00
gorogoroumaru
5a98f1f076 sort facetsDistribution attribute 2020-10-02 20:00:55 +09:00
bors[bot]
4398f2c023 Merge #982
982: fix backups r=MarinPostma a=LegendreM

* pluralize variable `backup_folder` -> `backups_folder`
* change env case `MEILI_backup_folder` -> `MEILI_BACKUPS_FOLDER`
* add miliseconds to backup ID to reduce colisions

Co-authored-by: many <maxime@meilisearch.com>
v0.15.0rc2 v0.15.0
2020-09-30 17:02:34 +00:00
Kerollmops
c6b883289c Remove the unused fetch_keywords function 2020-09-30 15:41:23 +02:00
many
afc3b0915b fix backups
* pluralize variable `backup_folder` -> `backups_folder`
* change env case `MEILI_backup_folder` -> `MEILI_BACKUPS_FOLDER`
* add miliseconds to backup ID to reduce colisions
* fix forgoten stats synchronization
2020-09-30 13:20:40 +02:00
Kerollmops
58237bd67f Introduce the average-number-of-document-by-word-pair-proximity infos subcommand 2020-09-29 18:32:48 +02:00
Kerollmops
991be8950e Rename the subcommand into average-number-of-positions-by-word-by-doc 2020-09-29 18:15:44 +02:00
Kerollmops
54370e228a Search for documents with longer proximities until we find enough 2020-09-29 17:37:14 +02:00
Kerollmops
f277ea134f Simplify some search function by reducing the number of parameters 2020-09-29 16:08:58 +02:00
Kerollmops
68f4af7d2e Improve the display of the number of processed documents 2020-09-29 16:08:58 +02:00
Kerollmops
59a127d022 Improve the indexing process
We now store the words pairs proximity in a cache and only compute the
shortest proximity between pairs of words in a document.
2020-09-29 15:09:18 +02:00
bors[bot]
f313de98c8 Merge #980
980: bump meilisearch to v0.15.0 r=Kerollmops a=MarinPostma



Co-authored-by: mpostma <postma.marin@protonmail.com>
v0.15.0rc0
2020-09-28 15:09:26 +00:00
mpostma
03d4651077 bump meilisearch 2020-09-28 16:56:05 +02:00
Kerollmops
6ddb3e722c Depth-first search cache the docids unions 2020-09-28 16:55:21 +02:00
Kerollmops
a3821a0b33 Introduce the depth_first_search path resolution function 2020-09-28 16:34:12 +02:00
bors[bot]
32f6a9a457 Merge #976
976: Revert 944 r=MarinPostma a=MarinPostma

revert #944 
@bidoubiwa  @curquiza @eskombro, this was a misunderstanding from our side. Doing this would in fact be an error, and would prevent us to do this: https://github.com/meilisearch/MeiliSearch/issues/945#issuecomment-685526678, which is what we are really after. We are resetting this to its default behaviour before it goes in prodution. Sorry for the confusion.

Co-authored-by: mpostma <postma.marin@protonmail.com>
2020-09-28 13:38:46 +00:00
bors[bot]
099a0802fc Merge #916
916: Considere an empty query search as a placeholder search r=MarinPostma a=qdequele

Fix #856; Relative tracking issue: #729

Co-authored-by: qdequele <quentin@meilisearch.com>
2020-09-28 13:13:47 +00:00
bors[bot]
e258e0b2c2 Merge #887
887: backup r=Kerollmops a=LegendreM

[Tracking Issue](https://github.com/meilisearch/MeiliSearch/issues/840)
[Documentation PR](https://github.com/meilisearch/documentation/pull/468)
[Other relevant issue](https://github.com/meilisearch/MeiliSearch/issues/884)

Co-authored-by: many <maxime@meilisearch.com>
2020-09-28 12:47:08 +00:00
many
c254320860 Implement backups
* trigger backup importation via http route
* follow backup advancement with status route
* import backup via a command line
* let user choose batch size of documents to import (command lines)

closes #884
closes #840
2020-09-28 14:40:06 +02:00
qdequele
51fd849852 cargo fmt 2020-09-28 14:23:32 +02:00
qdequele
ab170ce4fd add test 2020-09-28 14:19:45 +02:00
qdequele
90226dc8a9 Considere an empty query search as a placeholder search #916 2020-09-28 14:19:45 +02:00
Kerollmops
51c237f9d8 Fix the benchmarks compilation 2020-09-28 13:39:17 +02:00
Clément Renault
d8354f6f02 Fix the word_docids capacity limit detection 2020-09-27 11:52:05 +02:00
bors[bot]
63868b2600 Merge #977
977: update pest dependency r=Kerollmops a=MarinPostma

update pest dependency to official repo

Co-authored-by: mpostma <postma.marin@protonmail.com>
2020-09-25 19:35:25 +00:00
mpostma
22d439f682 update pest dependency 2020-09-24 18:36:38 +02:00
bors[bot]
394f2abd49 Merge #971
971: Meili tests r=MarinPostma a=MarinPostma

#869 

Co-authored-by: mpostma <postma.marin@protonmail.com>
2020-09-24 16:06:35 +00:00
mpostma
030bcd8b05 Revert "facet count more tests"
This reverts commit 954f572e79.
2020-09-24 16:40:18 +02:00
mpostma
d8d29d3615 Revert "fix facet count bug"
This reverts commit 733c02dd7c.
2020-09-24 16:39:42 +02:00
Clément Renault
25b2853b70 Move the words pairs proximities compute into the write document function 2020-09-23 15:02:40 +02:00
Clément Renault
ed05999f63 Replace the arc cache by a simple linked hash map 2020-09-23 14:50:52 +02:00