| 
							
							
								![meili-bors[bot]](/avatar/c50245314bc32fd9ece1a2e588f6124c?size=56) meili-bors[bot] | c2c1ba39ee | Merge #4826 4826: Update Charabia v0.9.0 r=dureuill a=ManyTheFish
# Pull Request
## Related Changelog
https://github.com/meilisearch/charabia/releases/tag/v0.9.0
## Notable Change for Meilisearch
Adds all math symbols from https://www.compart.com/en/unicode/category/Sm to the default separator list.
Co-authored-by: ManyTheFish <many@meilisearch.com> | 2024-07-25 14:08:38 +00:00 |  | 
			
				
					| 
							
							
								 ManyTheFish | 35567b2137 | Update Charabia v0.9.0 | 2024-07-25 16:02:14 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | d4ea7cc2a9 | fix clippy đđ | 2024-07-25 12:10:32 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 2413592bbf | Display docid when there are documents without manual embeddings for a manual embedder | 2024-07-25 12:10:32 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 553440632e | Introduce Setting::some_or_not_set | 2024-07-25 12:01:52 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 7a347966da | Allow explicit dimensionsfor ollama | 2024-07-25 12:01:51 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 4654d51e05 | Add custom headers for REST embedder | 2024-07-25 12:01:51 +02:00 |  | 
			
				
					| 
							
							
								 ManyTheFish | a918561ac1 | Fix PR comments | 2024-07-25 10:52:56 +02:00 |  | 
			
				
					| 
							
							
								 ManyTheFish | 70d71581ee | fix clippy | 2024-07-25 10:52:56 +02:00 |  | 
			
				
					| 
							
							
								 ManyTheFish | 04fa44e7eb | Implement localized attributes settings | 2024-07-25 10:51:27 +02:00 |  | 
			
				
					| 
							
							
								 ManyTheFish | 90c0a6db7d | Implement localized search | 2024-07-25 10:51:27 +02:00 |  | 
			
				
					| 
							
							
								 ManyTheFish | cc02920f2b | Update charabia | 2024-07-25 10:51:27 +02:00 |  | 
			
				
					| 
							
							
								 Tamo | 988552e178 | add tests on the rest embedder | 2024-07-24 14:34:17 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 0d8199f3b7 | Change parameters in milli settings | 2024-07-24 14:34:17 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 4b74803dae | Change parameters in vector settings | 2024-07-24 14:34:17 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | d731fa661b | ollama and openai use new EmbedderOptions | 2024-07-24 14:34:17 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | a1beddd5d9 | rest embedder: use json_template | 2024-07-24 14:34:17 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 4109182ca4 | Add json_template module | 2024-07-24 14:34:12 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 1a297c048e | Error changes | 2024-07-24 14:34:12 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 303e601b87 | HuggingFace: Clearer error message when a model is not supported | 2024-07-23 15:13:22 +02:00 |  | 
			
				
					| 
							
							
								![meili-bors[bot]](/avatar/c50245314bc32fd9ece1a2e588f6124c?size=56) meili-bors[bot] | ea73615abf | Merge #4804 4804: Implements the experimental contains filter operator r=irevoire a=irevoire
# Pull Request
Related PRD: (private link) https://www.notion.so/meilisearch/Contains-Like-Filter-Operator-0d8ad53c6761466f913432eb1d843f1e
Public usage page: https://meilisearch.notion.site/Contains-filter-operator-usage-3e7421b0aacf45f48ab09abe259a1de6
## Related issue
Fixes https://github.com/meilisearch/meilisearch/issues/3613
## What does this PR do?
- Extract the contains operator from this PR: https://github.com/meilisearch/meilisearch/pull/3751
- Gate it behind a feature flag
- Add tests
Co-authored-by: Tamo <tamo@meilisearch.com> | 2024-07-17 15:47:11 +00:00 |  | 
			
				
					| 
							
							
								 Tamo | 02c61eabfa | fix the range reported when the experimental feature has not been set | 2024-07-17 16:54:33 +02:00 |  | 
			
				
					| 
							
							
								 Tamo | 2af9481804 | Implements the experimental contains filter operator« | 2024-07-17 11:13:37 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 24240934f9 | Improve errors when indexing documents with a user provided embedder | 2024-07-16 13:39:01 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | f4c94ac57f | manual embedders: limit max size of errors to 250 | 2024-07-16 13:39:01 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 4087a88dbe | rest|ollama|openai: increase tries to 10 + randomize retry duration | 2024-07-16 13:39:00 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 5adacf2f45 | OpenAI: embed only the first MAX_TOKENS tokens | 2024-07-16 13:39:00 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 65d0c32aa7 | Allow overriding OpenAI's url | 2024-07-16 13:39:00 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 82647bcded | When retrieveVectorsis true, retrieve_vectors.embeddereven if there are no vector for that embedder | 2024-07-16 13:39:00 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | e83da00446 | Milli changes to match to allow for more flexible lifetimes | 2024-07-11 16:29:35 +02:00 |  | 
			
				
					| 
							
							
								 Louis Dureuil | 7fb3e378ff | Do not fail sort comparisons when the field name or target point are different | 2024-07-11 16:28:14 +02:00 |  | 
			
				
					| 
							
							
								![meili-bors[bot]](/avatar/c50245314bc32fd9ece1a2e588f6124c?size=56) meili-bors[bot] | 29b44e5541 | Merge #4626 4626: Edit Documents with Rhai r=ManyTheFish a=Kerollmops
This PR introduces a first version of [the _Update Documents with Function_ (internal)](https://www.notion.so/meilisearch/Update-Documents-by-Function-45f87b13e61c4435b73943768a490808). It uses [the Rhai programming language](https://rhai.rs/) to let users express the modifications they want apply.
You can read more about the way to use this functions on [the Usage PRD Page](https://meilisearch.notion.site/Edit-Documents-with-Rhai-0cff8fea7655436592e7c8a6de932062?pvs=25). The [prototype is available](https://github.com/meilisearch/meilisearch/actions/runs/9038384483) through Docker by using the following command:
```
docker run -p 7700:7700 -v $(pwd)/meili_data:/meili_data getmeili/meilisearch:prototype-edit-documents-with-rhai-3
```
## TODO
 - [x] Support the `DocumentEdition` task in dumps.
 - [x] Remove the unwraps and panics.
 - [x] Improve error codes for the `function` parameter.
 - [x] [Update Rhai to v1.19.0](https://github.com/rhaiscript/rhai/releases/tag/v1.19.0) đ
 - [x] Make it an experimental feature (only restrict the HTTP calls).
 - [x] It must be possible not to send a context.
 - [x] Rebase on main.
 - [x] Check that the script cannot do any io.
 - [x] ~Introduce a `Documents.edit` action or~ require the `Documents.all` action.
 - [x] Change the `editionCode` to the clearer `function` field name in the tasks.
 - [x] Support a user provided context and maybe more (but keep function execution isolated for reproducibility).
 - [x] Support deleting documents when the `doc` is `()` (nil, null).
 - [x] Support canceling document edition.
 - [x] Multithread document edition by using rayon (and [rayon-par-bridge](https://docs.rs/rayon-par-bridge/latest/rayon_par_bridge/)).
 - [x] Limit the number of instruction by function execution.
 - [ ] ~Expose the limit of instructions in the settings.~ Not sure, in fact.
 - [x] Ignore unmodified documents in the tasks count.
 - [x] Make the `filter` field optional (not forced to be `null`).
Co-authored-by: Clément Renault <clement@meilisearch.com> | 2024-07-11 09:02:55 +00:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 6e80364c50 | Apply review comments | 2024-07-11 11:00:27 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 3bac22fd87 | We do not do intersections with the universe when it is related to cache | 2024-07-10 16:49:36 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | ce61cb7fe6 | Simplify and speedup an intersection pass | 2024-07-10 16:49:36 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 1693d1a311 | Simplify the check to decide to stop a loop | 2024-07-10 16:49:36 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | febea735ca | Remove the unused universe parameter from resolve_negative_phrases | 2024-07-10 16:49:36 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 93ba051094 | Remove the invalid get_phrases_docids universe parameter | 2024-07-10 16:49:35 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | cd7a20fa32 | Make it work by avoid storing invalid stuff in the cache | 2024-07-10 16:49:35 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 41f51adbec | Do less useless intersections | 2024-07-10 16:49:35 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 0ca1a4e805 | Always do the intersections with the universe | 2024-07-10 16:49:34 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 50a7393c55 | Modify the compute_query_term_subset_docids function to accept the universe | 2024-07-10 16:49:34 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 837274f853 | Restrict even more the Rhai engine | 2024-07-10 16:30:18 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | aace587dd1 | Create errors for the internal processing ones | 2024-07-10 16:29:18 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | f35d6710f3 | Update rhai to v1.19.0 | 2024-07-10 16:29:17 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 81ec0abad1 | Use the new rayon-par-bridge library | 2024-07-10 16:29:04 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | b67d385cf0 | Parallelize the edition functions | 2024-07-10 16:28:54 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | dfecb25814 | Disable the time package | 2024-07-10 16:28:37 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 2eae2015d7 | Support aborting documents edition by function | 2024-07-10 16:28:15 +02:00 |  | 
			
				
					| 
							
							
								 Clément Renault | 33fa17bf12 | Support deleting documents with functions | 2024-07-10 16:28:15 +02:00 |  |