meilisearch

mirror of https://github.com/meilisearch/meilisearch.git synced 2025-10-25 13:06:27 +00:00

Author	SHA1	Message	Date
meili-bors[bot]	5046ffdf54	Merge #4512 4512: Revert "Revert "Merge remote-tracking branch 'origin/main' into release-v1.7.1"" r=Kerollmops a=irevoire Reverts meilisearch/meilisearch#4510 This PR was supposed to be merged on `release-v1.7.1` not main 🤦 Co-authored-by: Tamo <irevoire@protonmail.ch>	2024-03-20 09:14:43 +00:00
Tamo	c5322df519	Revert "Revert "Merge remote-tracking branch 'origin/main' into release-v1.7.1""	2024-03-20 10:08:28 +01:00
meili-bors[bot]	c495c8eb33	Merge #4510 4510: Revert "Merge remote-tracking branch 'origin/main' into release-v1.7.1" r=Kerollmops a=irevoire In https://github.com/meilisearch/meilisearch/pull/4502 we merged main into release-v1.7.1 instead of a temporary branch thus we now need to revert this merge commit. This reverts commit `bd74cce86a`, reversing changes made to `d2f77e88bd`. Co-authored-by: Tamo <tamo@meilisearch.com>	2024-03-19 16:02:24 +00:00
Tamo	567194b925	Revert "Merge remote-tracking branch 'origin/main' into release-v1.7.1" This reverts commit `bd74cce86a`, reversing changes made to `d2f77e88bd`.	2024-03-19 16:56:21 +01:00
meili-bors[bot]	5233534dc0	Merge #4477 4477: Add documentation for benchmarks r=dureuill a=dureuill See [CONTRIBUTING.md](https://github.com/meilisearch/meilisearch/blob/benchmark-docs/CONTRIBUTING.md#logging) Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-03-19 13:23:48 +00:00
meili-bors[bot]	fced2ff9ab	Merge #4502 4502: Release v1.7.1 r=dureuill a=Kerollmops Bring the v1.7.1 changes back to main. Co-authored-by: Clément Renault <clement@meilisearch.com> Co-authored-by: Kerollmops <Kerollmops@users.noreply.github.com> Co-authored-by: meili-bors[bot] <89034592+meili-bors[bot]@users.noreply.github.com>	2024-03-19 12:41:28 +00:00
Clément Renault	bd74cce86a	Merge remote-tracking branch 'origin/main' into release-v1.7.1	2024-03-19 13:39:17 +01:00
meili-bors[bot]	f85c80d059	Merge #4503 4503: Add settings diff indexing benchmarks r=dureuill a=ManyTheFish Add several benchmarks targetting settings diff-indexing enhancements Co-authored-by: ManyTheFish <many@meilisearch.com>	2024-03-19 10:35:46 +00:00
Louis Dureuil	2a92c04100	Adding new assets	2024-03-19 11:31:32 +01:00
ManyTheFish	e8516f00c4	move settings workload in root workload directory	2024-03-19 10:41:30 +01:00
ManyTheFish	29e71eedc7	Add benchmarks	2024-03-18 18:31:28 +01:00
meili-bors[bot]	10d053cd2f	Merge #4500 4500: Don't display dimensions as 0 when it is not set r=ManyTheFish a=dureuill Fixes regression in embedders where `dimensions: 0` was displayed when it hadn't be set for the `openAi` source. Was breaking a PHP SDK integration test: `cbaecb8c55/tests/Settings/EmbeddersTest.php (L28)` Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-03-18 15:21:24 +00:00
Louis Dureuil	a302e258bd	Don't display dimensions as 0 when it is not set	2024-03-18 16:10:12 +01:00
meili-bors[bot]	29840473b4	Merge #4499 4499: Fix milli link in contributing doc r=curquiza a=mohsen-alizadeh # Pull Request ## Related issue Fixes #4498 ## What does this PR do? The milli link in CONTRIBUTING.md targeted the archived milli repository. it has to be changed to target to the milli crate in the main repo ## PR checklist Please check if your PR fulfills the following requirements: - [X] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)? - [X] Have you read the contributing guidelines? - [X] Have you made sure that the title is accurate and descriptive of the changes? Thank you so much for contributing to Meilisearch! Co-authored-by: Mohsen Alizadeh <mohsen@alizadeh.us> Co-authored-by: Clémentine U. - curqui <clementine@meilisearch.com>	2024-03-18 14:39:26 +00:00
Clémentine U. - curqui	f4037c1a95	Update CONTRIBUTING.md Co-authored-by: Clément Renault <renault.cle@gmail.com>	2024-03-18 15:39:01 +01:00
Mohsen Alizadeh	13cc62728b	Fix milli link in contributing doc	2024-03-17 19:29:42 -07:00
meili-bors[bot]	f84bcb09e1	Merge #4491 4491: chore: remove repetitive words r=curquiza a=shuangcui # Pull Request ## Related issue Fixes #<issue_number> ## What does this PR do? - ... ## PR checklist Please check if your PR fulfills the following requirements: - [ ] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)? - [ ] Have you read the contributing guidelines? - [ ] Have you made sure that the title is accurate and descriptive of the changes? Thank you so much for contributing to Meilisearch! Co-authored-by: shuangcui <fliter@qq.com>	2024-03-14 17:44:01 +00:00
shuangcui	5c95b5c933	chore: remove repetitive words Signed-off-by: shuangcui <fliter@qq.com>	2024-03-14 21:28:55 +08:00
meili-bors[bot]	0b7bebeeb6	Merge #4483 4483: Workflows: Fix reason param when benches are triggered from a comment. r=irevoire a=dureuill Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-03-13 17:05:30 +00:00
meili-bors[bot]	d2f77e88bd	Merge #4479 4479: Skip reindexing when modifying unknown faceted fields r=dureuill a=Kerollmops This PR improves Meilisearch's decision to reindex when a faceted field is added to the settings, but not a single document contains this field. It is effectively a waste of time to reindex documents when the engine needs to know a field. This is related to a conversation [we have with our biggest customer (internal link)](https://discord.com/channels/1006923006964154428/1101213808627830794/1217112918857089187). They have 170 million documents, so reindexing this amount would be problematic. --- The image is available by using the following Docker command. You can see the advancement of the image's build [on the GitHub CI page](https://github.com/meilisearch/meilisearch/actions/runs/8251688778). ``` docker pull getmeili/meilisearch:prototype-no-reindex-unknown-fields-0 ``` Here is the hand-made test that shows that when modifying unknown filterable attributes, here `lol`, it doesn't reindex. However, when modifying the known `genre` field, it does reindex. You can see all that by looking at the time spent processing the update. ```json { "uid": 3, "indexUid": "movies", "status": "succeeded", "type": "settingsUpdate", "canceledBy": null, "details": { "filterableAttributes": [ "genres" ] }, "error": null, "duration": "PT9.237703S", "enqueuedAt": "2024-03-12T15:34:26.836083Z", "startedAt": "2024-03-12T15:34:26.836374Z", "finishedAt": "2024-03-12T15:34:36.074077Z" }, { "uid": 2, "indexUid": "movies", "status": "succeeded", "type": "settingsUpdate", "canceledBy": null, "details": { "filterableAttributes": [ "lol" ] }, "error": null, "duration": "PT0.000751S", "enqueuedAt": "2024-03-12T15:33:53.563923Z", "startedAt": "2024-03-12T15:33:53.565259Z", "finishedAt": "2024-03-12T15:33:53.56601Z" }, { "uid": 0, "indexUid": "movies", "status": "succeeded", "type": "documentAdditionOrUpdate", "canceledBy": null, "details": { "receivedDocuments": 31944, "indexedDocuments": 31944 }, "error": null, "duration": "PT3.120723S", "enqueuedAt": "2024-02-17T10:35:55.042864Z", "startedAt": "2024-02-17T10:35:55.043505Z", "finishedAt": "2024-02-17T10:35:58.164228Z" } ``` Co-authored-by: Clément Renault <clement@meilisearch.com> v1.7.1	2024-03-13 16:23:32 +00:00
meili-bors[bot]	1d8c13f595	Merge #4487 4487: Update version for the next release (v1.7.1) in Cargo.toml r=Kerollmops a=meili-bot ⚠️ This PR is automatically generated. Check the new version is the expected one and Cargo.lock has been updated before merging. Co-authored-by: Kerollmops <Kerollmops@users.noreply.github.com>	2024-03-13 15:41:10 +00:00
Kerollmops	7f3c495f5c	Update version for the next release (v1.7.1) in Cargo.toml	2024-03-13 14:49:21 +00:00
meili-bors[bot]	abd954755d	Merge #4476 4476: Make the `/facet-search` route use the `sortFacetValuesBy` setting r=irevoire a=Kerollmops This PR fixes #4423 by ensuring that the `/facet-search` route uses the `sortFacetValuesBy` setting. Note for the documentation team (to be moved in the tracking issue): Using the new `sortFacetValuesBy` setting can slow down the facet-search requests as Meilisearch iterates over the whole list of facet values and computes the count of documents on every entry. That is hardly or even impossible to optimize correctly. ### TODO - [x] Create a custom HashMap wrapper for the facet `OrderBy` settings. This wrapper will return the `OrderBy` setting of the facet, if not defined will use the default `*` one, and if not there either (strange) will fall back on the lexicographic one. - [x] Create a `ValuesCollection` wrapper that implements the logic for the lexicographic and count order by. - [x] Use it when there is no search query. - [x] Use it when there is a search query with and without allowed typos. - [x] Do not change the original logic, only use a wrapper. - [x] Add tests Co-authored-by: Clément Renault <clement@meilisearch.com>	2024-03-13 14:36:14 +00:00
Clément Renault	f3fc2bd01f	Address some issues with preallocations	2024-03-13 15:22:14 +01:00
Louis Dureuil	6fa3872268	Workflows: Fix reason param when benches are triggered from a comment.	2024-03-13 13:46:43 +01:00
Clément Renault	6c9823d7bb	Add tests to sortFacetValuesBy count	2024-03-13 11:59:39 +01:00
Clément Renault	e0dac5a22f	Simplify the algorithm by using the new facet values collection wrapper	2024-03-13 11:31:34 +01:00
Clément Renault	b918b55c6b	Introduce a new facet value collection wrapper to simply the usage	2024-03-13 11:31:34 +01:00
meili-bors[bot]	07b1d0edaf	Merge #4475 4475: Allow running benchmarks without sending results to the dashboard r=irevoire a=dureuill Adds a `--no-dashboard` option to avoid sending results to the dashboard. Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-03-13 09:59:52 +00:00
Clément Renault	306b25ad3a	Move the searchForFacetValues struct into a dedicated module	2024-03-13 10:24:21 +01:00
Clément Renault	9f7a4fbfeb	Return the facets of a placeholder facet-search sorted by count	2024-03-13 10:09:01 +01:00
meili-bors[bot]	5ed7b6a0b2	Merge #4456 4456: Add Ollama as an embeddings provider r=dureuill a=jakobklemm # Pull Request ## Related issue [Related Discord Thread](https://discord.com/channels/1006923006964154428/1211977150316683305) ## What does this PR do? - Adds Ollama as a provider of Embeddings besides HuggingFace and OpenAI under the name `ollama` - Adds the environment variable `MEILI_OLLAMA_URL` to set the embeddings URL of an Ollama instance with a default value of `http://localhost:11434/api/embeddings` if no variable is set - Changes some of the structs and functions in `openai.rs` to be public so that they can be shared. - Added more error variants for Ollama specific errors - It uses the model `nomic-embed-text` as default, but any string value is allowed, however it won't automatically check if the model actually exists or is an embedding model Tested against Ollama version `v0.1.27` and the `nomic-embed-text` model. ## PR checklist Please check if your PR fulfills the following requirements: - [x] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)? - [x] Have you read the contributing guidelines? - [x] Have you made sure that the title is accurate and descriptive of the changes? Co-authored-by: Jakob Klemm <jakob@jeykey.net> Co-authored-by: Louis Dureuil <louis.dureuil@gmail.com>	2024-03-13 08:48:47 +00:00
Louis Dureuil	ae67d5eef0	Update milli/src/vector/error.rs Fix Meilisearch capitalization	2024-03-13 09:45:04 +01:00
Jakob Klemm	88bc9556a9	Add Ollama dimension inference and add clearer errors Instead of the user manually specifying the model dimensions it will now automatically get determined Just like with hf.rs the word "test" gets embedded to determine the dimensions of the output Add a dedicated error type for if the model doesn't exist (don't automatically pull it though) and set the fault of that error to be the user	2024-03-12 19:59:11 +01:00
Clément Renault	ca4876fd10	Do not reindex when modifying unknown faceted field prototype-no-reindex-unknown-fields-0	2024-03-12 16:18:58 +01:00
Clément Renault	d3a95ea2f6	Introduce a new OrderByMap struct to simplify the sort by usage	2024-03-12 13:56:56 +01:00
Louis Dureuil	88d27949cd	Add documentation for benchmarks	2024-03-12 10:56:16 +01:00
Clément Renault	69c118ef76	Extract the facet order before extracting the facets values	2024-03-12 10:35:39 +01:00
meili-bors[bot]	d44e20aa89	Merge #4474 4474: Update cargo version r=irevoire a=curquiza Fixes #4417 Co-authored-by: curquiza <clementine@meilisearch.com>	2024-03-12 09:27:22 +00:00
Louis Dureuil	7b670a4afa	Allow dry runs for benchmarks where reports are generated but not sent to the dashboard	2024-03-12 10:26:13 +01:00
curquiza	fde209b7b6	Update cargo version	2024-03-12 10:20:07 +01:00
meili-bors[bot]	904b82a61d	Merge #4473 4473: Bring back changes from v1.7.0 to main r=curquiza a=curquiza Co-authored-by: ManyTheFish <many@meilisearch.com> Co-authored-by: Louis Dureuil <louis@meilisearch.com> Co-authored-by: Many the fish <many@meilisearch.com> Co-authored-by: Tamo <tamo@meilisearch.com> Co-authored-by: meili-bors[bot] <89034592+meili-bors[bot]@users.noreply.github.com>	2024-03-11 15:02:47 +00:00
Tamo	8ec3e30d2b	Merge branch 'main' into tmp-release-v1.7.0	2024-03-11 15:39:51 +01:00
meili-bors[bot]	0a59cb9734	Merge #4463 4463: Add tests when the field limit is reached r=Kerollmops a=irevoire # Pull Request ## Related issue Related to https://github.com/meilisearch/meilisearch/discussions/4429#discussioncomment-8689101 This user found out that the error message we’re supposed to return when the maximum number of attributes is reached is _not_ returned in some cases ## What does this PR do? - This PR adds four tests around the maximum number of attributes: 1. Add a document with u16::MAX + 1 fields - Meilisearch panics 2. Add two documents which together adds up to u16::MAX + 1 fields - Meilisearch returns the expected error 3. Add a document with u16::MAX + 1 nested fields - No error message but the document isn’t indexed 4. Add two documents which together add up to u16::MAX + 1 nested fields - Meilisearch doesn’t return any error but doesn’t index the document ## PR checklist Please check if your PR fulfills the following requirements: - [x] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)? - [x] Have you read the contributing guidelines? - [x] Have you made sure that the title is accurate and descriptive of the changes? Thank you so much for contributing to Meilisearch! Co-authored-by: Tamo <tamo@meilisearch.com>	2024-03-07 10:36:54 +00:00
Tamo	f053c280e1	add tests when the field limit is reached	2024-03-06 18:42:41 +01:00
meili-bors[bot]	ee3076d5ba	Merge #4462 4462: Divide threshold by ten r=dureuill a=ManyTheFish Change the facet incremental vs bulk indexing threshold to better fit our user needs, it might be changed in the future if we have more insights Co-authored-by: ManyTheFish <many@meilisearch.com> v1.7.0 v1.7.0-rc.2	2024-03-06 13:05:38 +00:00
meili-bors[bot]	ab1224bfa7	Merge #4458 4458: Replace logging timer by spans r=Kerollmops a=dureuill - Remove logging timer dependency. - Remplace last uses in search by spans Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-03-05 16:43:23 +00:00
meili-bors[bot]	eefc1c421e	Merge #4459 4459: Put a bound on OpenAI timeout r=dureuill a=dureuill # Pull Request ## Related issue Fixes #4460 ## What does this PR do? - Makes sure that the timeout of the openai embedder is limited to max 1min, rather than the prior 15min+ Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-03-05 15:18:51 +00:00
meili-bors[bot]	4d42a7af7c	Merge #4445 4445: Add subcommand to run benchmarks r=irevoire a=dureuill # Pull Request ## Related issue Not user-facing, no issue ## What does this PR do? - Adds a new `cargo xtask bench` subcommand that can run one or multiple workload files and report the results to a server - A workload file is a JSON file with a specific schema - Refactor our use of the `vergen` crate: - update to the beta `vergen-git2` crate - VERGEN_GIT_SEMVER_LIGHTWEIGHT => VERGEN_GIT_DESCRIBE - factor logic in a single `build-info` crate that is used both by meilisearch and xtask (prevents vergen variables from overriding themselves) - checked that defining the variables by hand when no git repo is available (docker build case) still works. - Add CI to run `cargo xtask bench` Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-03-05 14:03:57 +00:00
Louis Dureuil	7408db2a46	Meilisearch: fix date formatting	2024-03-05 14:56:48 +01:00

1 2 3 4 5 ...

9153 Commits