meilisearch

mirror of https://github.com/meilisearch/meilisearch.git synced 2025-10-25 04:56:28 +00:00

Author	SHA1	Message	Date
ManyTheFish	3b3fa38f27	Put the restrict list in a sub-struct	2023-11-28 18:37:57 +01:00
ManyTheFish	d6c2ee15a9	Filter on attributes before computing the docids when attribute restriction is on	2023-11-28 14:55:29 +01:00
ManyTheFish	dc07790133	Add test reproducing #4232	2023-11-27 11:39:11 +01:00
meili-bors[bot]	b11f85a635	Merge #4205 4205: Prevent search hang on the processing index r=Kerollmops a=dureuill Fixes #4206, an issue originally [reported on Discord](https://discord.com/channels/1006923006964154428/1148983671026618579/1148983671026618579) where having parallel search requests on more indexes than the index cache capacity would cause search requests on the currently updating index to hang until the index is done updating. ## Test setup - Create 20 empty indexes by sending settings to them - repeatedly send placeholder search requests to each of the indexes in a loop - Create another index and send a significant batch of documents to index. - Attempt to perform a search request on that last index. - Before this PR, the search request hangs while the index update task is processing - After this PR, the search request respond immediately even while the index update task is processing ## Changes - When getting the handle to an index for some potentially long running batches of tasks, save it in the index scheduler. - Drop the handle from the index-scheduler when the task is done so that we don't leak indexes. - When getting an index from outside the task queue processor, check if there is such an handle matching the requested index. If so, skip the cache entirely and clone the handle. Co-authored-by: Louis Dureuil <louis.dureuil@xinra.net> Co-authored-by: Louis Dureuil <louis@meilisearch.com> v1.5.0 v1.5.0-rc.3	2023-11-13 10:36:01 +00:00
Louis Dureuil	a2d6dc8571	Fix typo, remove caching for the change of index	2023-11-13 10:44:36 +01:00
meili-bors[bot]	ee1701157f	Merge #4204 4204: Throw error when the vector search is sent with the wrong size r=Kerollmops a=dureuill # Pull Request ## Related issue Fixes #4201 Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2023-11-13 09:43:20 +00:00
Louis Dureuil	8c649d8061	Throw error when the vector search is sent with the wrong size	2023-11-13 09:57:42 +01:00
Louis Dureuil	492fc086f0	cargo fmt	2023-11-12 21:53:11 +01:00
Louis Dureuil	a2d0c73b41	Save the currently updating index so that the search can access it at all times	2023-11-10 10:52:03 +01:00
meili-bors[bot]	54f0ee1ed2	Merge #4167 4167: Introduce the `meilitool` command line interface r=Kerollmops a=Kerollmops This PR introduces a small tool to help the Cloud team: - Clear the tasks queue by removing all the tasks - Dump a Meilisearch database without having to enqueue the task - Access this `meilitool` binary from the Docker Image ## TODO - [x] Modify the Docker File to ship with this new tool (`@curquiza,` could you review that, please?) - [x] Clear the tasks queue by removing all the tasks - [x] Add more logs to explain what is happening - [x] Clear the `update_files` folder - [x] Dump a Meilisearch database without having to enqueue the task - [x] Add more logs to explain what is happening - [x] Introduce a flag to skip dumping enqueued and processing tasks. - [x] Dump the instance uid. - [x] Dump the keys. - [x] Dump the tasks with the update files. - [x] Dump the index documents and settings. - [ ] ~Dump the experimental features~ Co-authored-by: Clément Renault <clement@meilisearch.com> v1.5.0-rc.2	2023-10-31 14:05:22 +00:00
Clément Renault	ce5647e730	Fix Dockerfile WORKDIR path prototype-meilitool-3	2023-10-30 17:27:59 +01:00
Clément Renault	b57b818b67	Don't use the last version of clap prototype-meilitool-2	2023-10-30 16:57:31 +01:00
Clément Renault	f7ea94e5f4	Modify the Dockerfile to compile meilisearch and meilitool	2023-10-30 16:32:17 +01:00
Clément Renault	53382bb1b8	Introduce a new flag to skip dumping enqueued/processing tasks	2023-10-30 14:32:10 +01:00
Clément Renault	5b004a2583	Add more logs to the dump exporter	2023-10-30 14:31:55 +01:00
Clément Renault	13416ccbf7	Introduce a new meilitool to help the cloud team	2023-10-30 14:30:20 +01:00
meili-bors[bot]	2614e7d9ca	Merge #4174 4174: Fix warnings r=dureuill a=irevoire Fix all the warnings found in the CI: https://github.com/meilisearch/meilisearch/actions/runs/6622576021/job/17988323623 Co-authored-by: Tamo <tamo@meilisearch.com> v1.5.0-rc.1	2023-10-30 10:12:54 +00:00
Tamo	e7244aa485	fix warnings	2023-10-30 11:00:46 +01:00
meili-bors[bot]	9cacc82307	Merge #4169 4169: update charabia r=curquiza a=ManyTheFish Update Charabia to v0.8.5 and add the new khmer tokenizer Co-authored-by: ManyTheFish <many@meilisearch.com>	2023-10-26 17:21:30 +00:00
ManyTheFish	4c6fddb1cb	update charabia	2023-10-26 17:01:10 +02:00
meili-bors[bot]	ca52021079	Merge #4154 4154: Update version for the next release (v1.5.0) in Cargo.toml r=curquiza a=meili-bot ⚠️ This PR is automatically generated. Check the new version is the expected one and Cargo.lock has been updated before merging. Co-authored-by: curquiza <curquiza@users.noreply.github.com>	2023-10-23 12:00:50 +00:00
curquiza	ee6f79d60b	Update version for the next release (v1.5.0) in Cargo.toml	2023-10-23 11:49:07 +00:00
meili-bors[bot]	e4c24ca6a3	Merge #4151 4151: Bring back changes from v1.4.2 into `release-v1.5.0` r=dureuill a=curquiza This will bring the fixes in v1.4.2 for v1.5.0 release Co-authored-by: curquiza <curquiza@users.noreply.github.com> Co-authored-by: Vivek Kumar <vivek.26@outlook.com> Co-authored-by: Louis Dureuil <louis.dureuil@gmail.com> v1.5.0-rc.0	2023-10-23 10:11:11 +00:00
Louis Dureuil	2bae9550c8	Add explanatory comment	2023-10-23 12:06:28 +02:00
Vivek Kumar	32c78ac8b1	add/update tests when search with distinct attribute & pagination with no ranking	2023-10-23 12:06:27 +02:00
Vivek Kumar	5fe7c4545a	compute all candidates correctly when skipping	2023-10-23 12:02:45 +02:00
curquiza	2042229927	Update version for the next release (v1.4.2) in Cargo.toml	2023-10-23 12:02:45 +02:00
meili-bors[bot]	eae9eab181	Merge #4126 4126: Make the experimental route /metrics activable via HTTP r=dureuill a=braddotcoffee # Pull Request ## Related issue Closes #4086 ## What does this PR do? - [x] Make `/metrics` available via HTTP as described in #4086 - [x] The users can still launch Meilisearch using the `--experimental-enable-metrics` flag. - [x] If the flag `--experimental-enable-metrics` is activated, a call to the `GET /experimental-features` route right after the launch will show `"metrics": true` even if the user has not called the `PATCH /experimental-features` route yet. - [x] Even if the --experimental-enable-metrics flag is present at launch, calling the `PATCH /experimental-features` route with `"metrics": false` disables the experimental feature. - [x] Update the spec - I was unable to find docs in this repository to update about the `/experimental-features` endpoint. I'll happily update if you point me in the right direction! ## PR checklist Please check if your PR fulfills the following requirements: - [x] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)? - [x] Have you read the contributing guidelines? - [x] Have you made sure that the title is accurate and descriptive of the changes? Co-authored-by: bwbonanno <bradfordbonanno@gmail.com> Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2023-10-23 08:51:37 +00:00
Louis Dureuil	cf8dad1ca0	index_scheduler.features() is no longer fallible	2023-10-23 10:38:56 +02:00
bwbonanno	dd619913da	Use RwLock to never persist cli state to db	2023-10-19 12:45:57 -07:00
meili-bors[bot]	9b55ff16e9	Merge #4134 4134: Bump rustix from 0.36.15 to 0.36.16 r=Kerollmops a=dependabot[bot] Bumps [rustix](https://github.com/bytecodealliance/rustix) from 0.36.15 to 0.36.16. <details> <summary>Commits</summary> <ul> <li><a href="`6534992521`"><code>6534992</code></a> chore: Release rustix version 0.36.16</li> <li><a href="`4928cf7a38`"><code>4928cf7</code></a> Disable riscv64 testing.</li> <li><a href="`8cc159c4c3`"><code>8cc159c</code></a> Fix the <code>test_ttyname_ok</code> test when /dev/stdin is inaccessable. (<a href="https://redirect.github.com/bytecodealliance/rustix/issues/821">#821</a>)</li> <li><a href="`6dc7ba9478`"><code>6dc7ba9</code></a> Downgrade dependencies and disable tests to compile under Rust 1.48.</li> <li><a href="`ded8986e7e`"><code>ded8986</code></a> Disable MIPS in CI. (<a href="https://redirect.github.com/bytecodealliance/rustix/issues/793">#793</a>)</li> <li><a href="`739f9c3ba0`"><code>739f9c3</code></a> Fixes for <code>Dir</code> on macOS, FreeBSD, and WASI.</li> <li><a href="`87481a97f4`"><code>87481a9</code></a> Merge pull request from GHSA-c827-hfw6-qwvm</li> <li>See full diff in <a href="https://github.com/bytecodealliance/rustix/compare/v0.36.15...v0.36.16">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=rustix&package-manager=cargo&previous-version=0.36.15&new-version=0.36.16)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting ``@dependabot` rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - ``@dependabot` rebase` will rebase this PR - ``@dependabot` recreate` will recreate this PR, overwriting any edits that have been made to it - ``@dependabot` merge` will merge this PR after your CI passes on it - ``@dependabot` squash and merge` will squash and merge this PR after your CI passes on it - ``@dependabot` cancel merge` will cancel a previously requested merge and block automerging - ``@dependabot` reopen` will reopen this PR if it is closed - ``@dependabot` close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - ``@dependabot` show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - ``@dependabot` ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - ``@dependabot` ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - ``@dependabot` ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/meilisearch/meilisearch/network/alerts). </details> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-10-19 08:01:36 +00:00
dependabot[bot]	e761db582f	Bump rustix from 0.36.15 to 0.36.16 Bumps [rustix](https://github.com/bytecodealliance/rustix) from 0.36.15 to 0.36.16. - [Release notes](https://github.com/bytecodealliance/rustix/releases) - [Commits](https://github.com/bytecodealliance/rustix/compare/v0.36.15...v0.36.16) --- updated-dependencies: - dependency-name: rustix dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2023-10-18 18:42:12 +00:00
bwbonanno	d8c649b3cd	Return recoverable error if we fail to retrieve metrics state	2023-10-18 08:28:24 -07:00
meili-bors[bot]	5e0485d8dd	Merge #4131 4131: Reduce proximity range from 7 to 3 r=Kerollmops a=ManyTheFish ## Summary This PR aims to reduce the impact of the proximity databases on the indexing time and on the database size by reducing the maximum distance between two words to be indexed in the proximity database. ## Stats ### Impact on database size and indexing time ![Impact on datasets](https://github.com/meilisearch/meilisearch/assets/6482087/28ed3d96-bdde-41c1-bdac-e90c1b1dbb23) ### Impact on search relevancy <details> \| dataset_name \| host_name \| Relevancy rate (Precision) \| completion_rate 25.00% \| completion_rate 50.00% \| completion_rate 75.00% \| completion_rate 100.00% \| \|--------------\|------------------\|------------------------------------\|-----------------\|-----------------\|-----------------\|-----------------\| \| FBIS \| 1_4_0 \| percentile-10 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FBIS \| 1_4_0 \| percentile-25 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FBIS \| 1_4_0 \| percentile-50 \| 0.00% \| 0.00% \| 5.00% \| 5.56% \| \| FBIS \| 1_4_0 \| percentile-75 \| 0.00% \| 12.50% \| 35.00% \| 45.00% \| \| FBIS \| 1_4_0 \| percentile-90 \| 20.00% \| 40.00% \| \| 100.00% \| \| FBIS \| 1_4_0 \| average \| 5.78% \| 11.16% \| 21.90% \| 26.29% \| \| FBIS \| reduce_proximity \| percentile-10 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FBIS \| reduce_proximity \| percentile-25 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FBIS \| reduce_proximity \| percentile-50 \| 0.00% \| 0.00% \| 5.00% \| 5.56% \| \| FBIS \| reduce_proximity \| percentile-75 \| 0.00% \| 15.00% \| 35.00% \| 40.00% \| \| FBIS \| reduce_proximity \| percentile-90 \| 20.00% \| 40.00% \| 85.00% \| 100.00% \| \| FBIS \| reduce_proximity \| average \| 5.55% \| 11.34% \| 21.75% \| 26.14% \| \| FR94 \| 1_4_0 \| percentile-10 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FR94 \| 1_4_0 \| percentile-25 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FR94 \| 1_4_0 \| percentile-50 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FR94 \| 1_4_0 \| percentile-75 \| 0.00% \| 5.00% \| 15.00% \| 42.11% \| \| FR94 \| 1_4_0 \| percentile-90 \| 15.00% \| 54.55% \| 100.00% \| 100.00% \| \| FR94 \| 1_4_0 \| average \| 5.95% \| 12.07% \| 18.70% \| 25.57% \| \| FR94 \| reduce_proximity \| percentile-10 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FR94 \| reduce_proximity \| percentile-25 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FR94 \| reduce_proximity \| percentile-50 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FR94 \| reduce_proximity \| percentile-75 \| 0.00% \| 5.00% \| 15.00% \| 42.11% \| \| FR94 \| reduce_proximity \| percentile-90 \| 15.00% \| 54.55% \| 100.00% \| 100.00% \| \| FR94 \| reduce_proximity \| average \| 5.79% \| 12.00% \| 18.70% \| 25.53% \| \| FT \| 1_4_0 \| percentile-10 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FT \| 1_4_0 \| percentile-25 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FT \| 1_4_0 \| percentile-50 \| 0.00% \| 0.00% \| 5.00% \| 10.00% \| \| FT \| 1_4_0 \| percentile-75 \| 0.00% \| 15.00% \| 30.00% \| 40.00% \| \| FT \| 1_4_0 \| percentile-90 \| 20.00% \| 50.00% \| 65.00% \| 100.00% \| \| FT \| 1_4_0 \| average \| 5.08% \| 12.58% \| 20.00% \| 25.49% \| \| FT \| reduce_proximity \| percentile-10 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FT \| reduce_proximity \| percentile-25 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| FT \| reduce_proximity \| percentile-50 \| 0.00% \| 0.00% \| 5.00% \| 10.00% \| \| FT \| reduce_proximity \| percentile-75 \| 0.00% \| 15.00% \| 30.00% \| 40.00% \| \| FT \| reduce_proximity \| percentile-90 \| 10.00% \| 45.00% \| 60.00% \| 100.00% \| \| FT \| reduce_proximity \| average \| 5.01% \| 12.64% \| 20.10% \| 25.53% \| \| LAT \| 1_4_0 \| percentile-10 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| LAT \| 1_4_0 \| percentile-25 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| LAT \| 1_4_0 \| percentile-50 \| 0.00% \| 0.00% \| 5.00% \| 5.00% \| \| LAT \| 1_4_0 \| percentile-75 \| 5.00% \| 15.00% \| 30.00% \| 30.00% \| \| LAT \| 1_4_0 \| percentile-90 \| 15.00% \| 45.00% \| 60.00% \| 80.00% \| \| LAT \| 1_4_0 \| average \| 4.80% \| 11.80% \| 17.88% \| 21.62% \| \| LAT \| reduce_proximity \| percentile-10 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| LAT \| reduce_proximity \| percentile-25 \| 0.00% \| 0.00% \| 0.00% \| 0.00% \| \| LAT \| reduce_proximity \| percentile-50 \| 0.00% \| 0.00% \| 5.00% \| 5.00% \| \| LAT \| reduce_proximity \| percentile-75 \| 0.00% \| 11.11% \| 25.00% \| 35.00% \| \| LAT \| reduce_proximity \| percentile-90 \| 15.00% \| 45.00% \| 55.00% \| 80.00% \| \| LAT \| reduce_proximity \| average \| 4.43% \| 11.23% \| 17.32% \| 21.45% \| </details> ### Impact on Search time \| dataset_name \| host_name \| 25.00% \| 50.00% \| 75.00% \| 100.00% \| Average \| \|--------------\|------------------\|------------:\|------------:\|------------:\|------------:\|-------------\| \| FBIS \| 1_4_0 \| 3.45 \| 7.446666667 \| 9.773489933 \| 9.620300752 \| 7.572614338 \| \| FBIS \| reduce_proximity \| 2.983333333 \| 5.316666667 \| 6.911073826 \| 7.637218045 \| 5.712072968 \| \| FR94 \| 1_4_0 \| 2.236666667 \| 4.45 \| 5.523489933 \| 4.560150376 \| 4.192576744 \| \| FR94 \| reduce_proximity \| 2.09 \| 3.991666667 \| 4.981543624 \| 4.266917293 \| 3.832531896 \| \| FT \| 1_4_0 \| 5.956666667 \| 9.656666667 \| 13.86912752 \| 10.83270677 \| 10.0787919 \| \| FT \| reduce_proximity \| 4.51 \| 5.981666667 \| 7.701342282 \| 6.766917293 \| 6.23998156 \| \| LAT \| 1_4_0 \| 5.856666667 \| 9.233333333 \| 12.98322148 \| 10.78759398 \| 9.715203865 \| \| LAT \| reduce_proximity \| 6.91 \| 6.706666667 \| 8.463087248 \| 8.265037594 \| 7.586197877 \| ## Technical approach - Ensure the MAX_DISTANCE constant is used everywhere needed - Reduce the MAX_DISTANCE from 8 to 4 ## Related TBD Co-authored-by: ManyTheFish <many@meilisearch.com>	2023-10-18 14:56:08 +00:00
ManyTheFish	27eec21415	Fix tests	2023-10-18 16:03:22 +02:00
bwbonanno	2b3adef796	Use index_scheduler from configured app_data in middleware	2023-10-17 08:17:13 -07:00
bwbonanno	956cfc5487	Add runtime check to metrics middleware	2023-10-16 13:48:57 -07:00
bwbonanno	12fc878640	Merge remote-tracking branch 'origin/main' into enable-metrics-http	2023-10-16 13:48:01 -07:00
meili-bors[bot]	0a2e8b92a9	Merge #4129 4129: Add webinar banner in README r=curquiza a=curquiza Co-authored-by: curquiza <clementine@meilisearch.com>	2023-10-16 17:35:48 +00:00
meili-bors[bot]	c7a3f80de6	Merge #4073 4073: Simplify Puffin report exports r=ManyTheFish a=Kerollmops This PR changes how we export Puffin reports by directly writing them to disk when the `exportPuffinReports` [experimental feature is enabled](https://www.meilisearch.com/docs/learn/experimental/overview) on the `/experimental-features` route. It also adds more puffing logging to the deletion phase and grenad helpers. The puffin reports are identified by the date and time at which they are exported. ## Todo List - [x] Change the CLI flag to be an API experimental option. - [x] Create [a PRD for this experimental feature (private)](https://www.notion.so/meilisearch/Export-Puffin-Reports-091df151e71c4edfb7d72f4bf995b3ea). - [x] Create and complete [a product discussion](https://github.com/meilisearch/product/discussions/693) (copy/paste PROFILING markdown?). - [x] Update the _PROFILING.md_ markdown file instructions. - [x] Change the debug logs of the processing operation (visible in puffin viewer). Co-authored-by: Clément Renault <clement@meilisearch.com> Co-authored-by: Kerollmops <clement@meilisearch.com>	2023-10-16 15:48:15 +00:00
curquiza	029d4de043	Add webinar banner in README	2023-10-16 14:38:10 +02:00
meili-bors[bot]	549f1bcccf	Merge #4125 4125: Rename benchmark CI file to find it easily in the manifest list r=Kerollmops a=curquiza Co-authored-by: curquiza <clementine@meilisearch.com>	2023-10-16 11:38:28 +00:00
bwbonanno	689ec7c7ad	Make the experimental route /metrics activable via HTTP	2023-10-13 22:12:54 +00:00
Clément Renault	3655d4bdca	Move the puffin file export logic into the run function	2023-10-13 13:11:30 +02:00
Clément Renault	055ca3935b	Update index-scheduler/src/batch.rs Co-authored-by: Tamo <tamo@meilisearch.com>	2023-10-13 13:11:30 +02:00
Kerollmops	1b8871a585	Make cargo insta happy	2023-10-13 13:11:30 +02:00
Kerollmops	bf8fac6676	Fix the tests	2023-10-13 13:11:30 +02:00
Kerollmops	f2a9e1ebbb	Improve the debugging experience in the puffin reports	2023-10-13 13:11:30 +02:00
Kerollmops	c45c6cf54c	Update the PROFILING.md file	2023-10-13 13:11:30 +02:00
Kerollmops	513e61e9a3	Remove the experimental CLI flag	2023-10-13 13:11:29 +02:00

1 2 3 4 5 ...

8594 Commits