ManyTheFish 
							
						 
					 
					
						
						
							
						
						e741bc1c62 
					 
					
						
						
							
							Add proximity_precision value into the analytics  
						
						
						
						
					 
					
						2023-12-14 16:48:06 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						6425996e36 
					 
					
						
						
							
							Change the naming of attributeScale and wordScale into byAttribute and byWord  
						
						
						
						
					 
					
						2023-12-14 16:31:00 +01:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						87bba98bd8 
					 
					
						
						
							
							Various changes  
						
						... 
						
						
						
						- fixed seed for arroy
- check vector dimensions as soon as it is provided to search
- don't embed whitespace 
						
						
					 
					
						2023-12-14 16:08:42 +01:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						217105b7da 
					 
					
						
						
							
							hybrid search uses semantic ratio, error handling  
						
						
						
						
					 
					
						2023-12-14 16:08:42 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						1b7c164a55 
					 
					
						
						
							
							Pass the semantic ratio to milli  
						
						
						
						
					 
					
						2023-12-14 16:08:42 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						f3f3944469 
					 
					
						
						
							
							Fix error checking  
						
						
						
						
					 
					
						2023-12-14 16:08:42 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						93dcbf598d 
					 
					
						
						
							
							Deserialize semantic ratio  
						
						
						
						
					 
					
						2023-12-14 16:08:42 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						ac68f33194 
					 
					
						
						
							
							Add simple test  
						
						
						
						
					 
					
						2023-12-14 16:08:42 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						9991152bbe 
					 
					
						
						
							
							Add TODOs  
						
						
						
						
					 
					
						2023-12-14 16:08:42 +01:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						3c1a14f1cd 
					 
					
						
						
							
							Add settings routes  
						
						
						
						
					 
					
						2023-12-14 16:08:42 +01:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						806e5b6899 
					 
					
						
						
							
							Tests pass  
						
						
						
						
					 
					
						2023-12-14 16:08:41 +01:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						e0cc775dc4 
					 
					
						
						
							
							Various changes  
						
						... 
						
						
						
						- DistributionShift in Search object (to be set from model in embed?)
- Fix issue where embedder index wasn't computed at search time
- Accept as default embedder either the "default" one, or the only embedder when there is only one 
						
						
					 
					
						2023-12-14 16:08:41 +01:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						12940d79a9 
					 
					
						
						
							
							WIP  
						
						... 
						
						
						
						- manual embedder
- multi embedders OK
- clippy + tests OK 
						
						
					 
					
						2023-12-14 16:08:41 +01:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						922a640188 
					 
					
						
						
							
							WIP multi embedders  
						
						... 
						
						
						
						fixed template bugs 
						
						
					 
					
						2023-12-14 16:08:41 +01:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						65e49b7092 
					 
					
						
						
							
							Remove stuff, add distribution shift (WIP)  
						
						
						
						
					 
					
						2023-12-14 16:08:38 +01:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						13c2c6c16b 
					 
					
						
						
							
							Small commit to add hybrid search and autoembedding  
						
						
						
						
					 
					
						2023-12-14 16:07:48 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						35e1981488 
					 
					
						
						
							
							Remove proximityPrecision form the experimental feature  
						
						
						
						
					 
					
						2023-12-14 15:52:42 +01:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						56571f762a 
					 
					
						
						
							
							Merge remote-tracking branch 'origin/main' into tmp-release-v1.5.1  
						
						
						
						
					 
					
						2023-12-13 11:57:01 +01:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						99fec27788 
					 
					
						
						
							
							Make the --max-number-of-batched-tasks argument experimental  
						
						
						
						
					 
					
						2023-12-12 10:55:39 +01:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						7e259cb0d2 
					 
					
						
						
							
							Expose the --max-number-of-batched-tasks argument  
						
						
						
						
					 
					
						2023-12-11 16:08:39 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						c9860c7913 
					 
					
						
						
							
							Small test fixes  
						
						
						
						
					 
					
						2023-12-06 15:49:05 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						03ffabe889 
					 
					
						
						
							
							Add a new dump test  
						
						
						
						
					 
					
						2023-12-06 15:49:05 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						1f4fc9c229 
					 
					
						
						
							
							Make the feature experimental  
						
						
						
						
					 
					
						2023-12-06 15:49:05 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						8cc3c54117 
					 
					
						
						
							
							Add proximityPrecision setting in settings route  
						
						
						
						
					 
					
						2023-12-06 15:49:05 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						0c3fa8cbc4 
					 
					
						
						
							
							Add tests on proximityPrecision setting  
						
						
						
						
					 
					
						2023-12-06 14:59:23 +01:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						170e063b80 
					 
					
						
						
							
							Remove the actix-web dependency from milli  
						
						
						
						
					 
					
						2023-11-28 17:19:57 +01:00 
						 
				 
			
				
					
						
							
							
								meili-bors[bot] 
							
						 
					 
					
						
						
							
						
						6376c342c1 
					 
					
						
						
							
							Merge  #4223  
						
						... 
						
						
						
						4223: Update to heed 0.20 r=dureuill a=Kerollmops
This PR brings the v0.20-alpha.9 version of heed into Meilisearch 🎉  The main goal is to test it in a real environment to make the necessary changes if needed. We also want to merge it as soon as possible during the pre-release phase to ensure we catch bugs before the release.
Most of the calls to heed are the same as before, except:
 - The `PolyDatabase` has been replaced with a `Database<Unspecified, Unspecified>`. We replaced the `get<T, U>()` by a `remap<T, U>().get()` calls.
 - The `Database` `append(...)` method has been replaced with a `put_with_flags(PutFlags::APPEND, ...)`.
 - The `RwTxn<'e, 'p>` has been simplified into a `RwTxn<'e>`.
 - The `BytesEncode/Decode` traits return a `Result<_, BoxedError>` instead of an `Option<_>`.
 - We no longer need to wrap and unwrap the `BEU32` integer when storing/getting them from heed.
### TODO
 - [x] Create actual, simple error types instead of using strings in the codecs.
### Follow-up work
 - Move the codecs into another member crate (we depend on the uuid one in the meilitool crate).
 - Display the internal decoding error in the `SerializationError` internal error variant.
Co-authored-by: Clément Renault <clement@meilisearch.com > 
						
						
					 
					
						2023-11-28 13:39:44 +00:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						5b563f872b 
					 
					
						
						
							
							Move the clippy attribute on the problematic part of the code  
						
						
						
						
					 
					
						2023-11-28 14:37:58 +01:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						1575456594 
					 
					
						
						
							
							Further reduce an async block  
						
						
						
						
					 
					
						2023-11-28 14:23:32 +01:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						d32eb11329 
					 
					
						
						
							
							Move to the v0.20.0-alpha.9 of heed  
						
						
						
						
					 
					
						2023-11-27 11:52:22 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						dc07790133 
					 
					
						
						
							
							Add test reproducing  #4232  
						
						
						
						
					 
					
						2023-11-27 11:39:11 +01:00 
						 
				 
			
				
					
						
							
							
								karribalu 
							
						 
					 
					
						
						
							
						
						85626cff8e 
					 
					
						
						
							
							Fixed payload limit setting being ignored for delete documents by batch route  
						
						
						
						
					 
					
						2023-11-25 18:41:16 +00:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						0dbf1a16ff 
					 
					
						
						
							
							Make clippy happy  
						
						
						
						
					 
					
						2023-11-23 14:11:38 +01:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						7cb7e37ba8 
					 
					
						
						
							
							Merge branch 'main' into tmp-release-v1.5.0  
						
						
						
						
					 
					
						2023-11-21 16:30:46 +01:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						e507ef5932 
					 
					
						
						
							
							Slow the logging down  
						
						
						
						
					 
					
						2023-11-01 13:49:32 +01:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						9fedd8101a 
					 
					
						
						
							
							Fix tests  
						
						
						
						
					 
					
						2023-10-30 15:11:07 +01:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						dfab6293c9 
					 
					
						
						
							
							Use an LMDB database to store the external documents ids  
						
						
						
						
					 
					
						2023-10-30 11:41:23 +01:00 
						 
				 
			
				
					
						
							
							
								meili-bors[bot] 
							
						 
					 
					
						
						
							
						
						2614e7d9ca 
					 
					
						
						
							
							Merge  #4174  
						
						... 
						
						
						
						4174: Fix warnings r=dureuill a=irevoire
Fix all the warnings found in the CI: https://github.com/meilisearch/meilisearch/actions/runs/6622576021/job/17988323623 
Co-authored-by: Tamo <tamo@meilisearch.com > 
						
						
					 
					
						2023-10-30 10:12:54 +00:00 
						 
				 
			
				
					
						
							
							
								Tamo 
							
						 
					 
					
						
						
							
						
						e7244aa485 
					 
					
						
						
							
							fix warnings  
						
						
						
						
					 
					
						2023-10-30 11:00:46 +01:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						4c6fddb1cb 
					 
					
						
						
							
							update charabia  
						
						
						
						
					 
					
						2023-10-26 17:01:10 +02:00 
						 
				 
			
				
					
						
							
							
								Vivek Kumar 
							
						 
					 
					
						
						
							
						
						32c78ac8b1 
					 
					
						
						
							
							add/update tests when search with distinct attribute & pagination with no ranking  
						
						
						
						
					 
					
						2023-10-23 12:06:27 +02:00 
						 
				 
			
				
					
						
							
							
								meili-bors[bot] 
							
						 
					 
					
						
						
							
						
						eae9eab181 
					 
					
						
						
							
							Merge  #4126  
						
						... 
						
						
						
						4126: Make the experimental route /metrics activable via HTTP r=dureuill a=braddotcoffee
# Pull Request
## Related issue
Closes  #4086 
## What does this PR do?
- [x] Make `/metrics` available via HTTP as described in #4086  
- [x] The users can still launch Meilisearch using the `--experimental-enable-metrics` flag.
- [x] If the flag `--experimental-enable-metrics` is activated, a call to the `GET /experimental-features` route right after the launch will show `"metrics": true` even if the user has not called the `PATCH /experimental-features` route yet.
- [x] Even if the --experimental-enable-metrics flag is present at launch, calling the `PATCH /experimental-features` route with `"metrics": false` disables the experimental feature.
- [x] Update the spec
    - I was unable to find docs in this repository to update about the `/experimental-features` endpoint. I'll happily update if you point me in the right direction!
## PR checklist
Please check if your PR fulfills the following requirements:
- [x] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)?
- [x] Have you read the contributing guidelines?
- [x] Have you made sure that the title is accurate and descriptive of the changes?
Co-authored-by: bwbonanno <bradfordbonanno@gmail.com >
Co-authored-by: Louis Dureuil <louis@meilisearch.com > 
						
						
					 
					
						2023-10-23 08:51:37 +00:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						cf8dad1ca0 
					 
					
						
						
							
							index_scheduler.features() is no longer fallible  
						
						
						
						
					 
					
						2023-10-23 10:38:56 +02:00 
						 
				 
			
				
					
						
							
							
								bwbonanno 
							
						 
					 
					
						
						
							
						
						dd619913da 
					 
					
						
						
							
							Use RwLock to never persist cli state to db  
						
						
						
						
					 
					
						2023-10-19 12:45:57 -07:00 
						 
				 
			
				
					
						
							
							
								bwbonanno 
							
						 
					 
					
						
						
							
						
						d8c649b3cd 
					 
					
						
						
							
							Return recoverable error if we fail to retrieve metrics state  
						
						
						
						
					 
					
						2023-10-18 08:28:24 -07:00 
						 
				 
			
				
					
						
							
							
								meili-bors[bot] 
							
						 
					 
					
						
						
							
						
						5e0485d8dd 
					 
					
						
						
							
							Merge  #4131  
						
						... 
						
						
						
						4131: Reduce proximity range from 7 to 3 r=Kerollmops a=ManyTheFish
## Summary
This PR aims to reduce the impact of the proximity databases on the indexing time and on the database size by reducing the maximum distance between two words to be indexed in the proximity database.
## Stats
### Impact on database size and indexing time

### Impact on search relevancy
<details>
| dataset_name | host_name        | Relevancy rate (Precision) | completion_rate  25.00% | completion_rate 50.00% | completion_rate 75.00% | completion_rate 100.00% |
|--------------|------------------|------------------------------------|-----------------|-----------------|-----------------|-----------------|
| FBIS         | 1_4_0            | percentile-10 |           0.00% |           0.00% |           0.00% |           0.00% |
| FBIS         | 1_4_0            | percentile-25 |           0.00% |           0.00% |           0.00% |           0.00% |
| FBIS         | 1_4_0            | percentile-50 |           0.00% |           0.00% |           5.00% |           5.56% |
| FBIS         | 1_4_0            | percentile-75 |           0.00% |          12.50% |          35.00% |          45.00% |
| FBIS         | 1_4_0            | percentile-90 |          20.00% |          40.00% |                 |         100.00% |
| FBIS         | 1_4_0            | average       |           5.78% |          11.16% |          21.90% |          26.29% |
| FBIS         | reduce_proximity | percentile-10 |           0.00% |           0.00% |           0.00% |           0.00% |
| FBIS         | reduce_proximity | percentile-25 |           0.00% |           0.00% |           0.00% |           0.00% |
| FBIS         | reduce_proximity | percentile-50 |           0.00% |           0.00% |           5.00% |           5.56% |
| FBIS         | reduce_proximity | percentile-75 |           0.00% |          15.00% |          35.00% |          40.00% |
| FBIS         | reduce_proximity | percentile-90 |          20.00% |          40.00% |          85.00% |         100.00% |
| FBIS         | reduce_proximity | average       |           5.55% |          11.34% |          21.75% |          26.14% |
| FR94         | 1_4_0            | percentile-10 |           0.00% |           0.00% |           0.00% |           0.00% |
| FR94         | 1_4_0            | percentile-25 |           0.00% |           0.00% |           0.00% |           0.00% |
| FR94         | 1_4_0            | percentile-50 |           0.00% |           0.00% |           0.00% |           0.00% |
| FR94         | 1_4_0            | percentile-75 |           0.00% |           5.00% |          15.00% |          42.11% |
| FR94         | 1_4_0            | percentile-90 |          15.00% |          54.55% |         100.00% |         100.00% |
| FR94         | 1_4_0            | average       |           5.95% |          12.07% |          18.70% |          25.57% |
| FR94         | reduce_proximity | percentile-10 |           0.00% |           0.00% |           0.00% |           0.00% |
| FR94         | reduce_proximity | percentile-25 |           0.00% |           0.00% |           0.00% |           0.00% |
| FR94         | reduce_proximity | percentile-50 |           0.00% |           0.00% |           0.00% |           0.00% |
| FR94         | reduce_proximity | percentile-75 |           0.00% |           5.00% |          15.00% |          42.11% |
| FR94         | reduce_proximity | percentile-90 |          15.00% |          54.55% |         100.00% |         100.00% |
| FR94         | reduce_proximity | average       |           5.79% |          12.00% |          18.70% |          25.53% |
| FT           | 1_4_0            | percentile-10 |           0.00% |           0.00% |           0.00% |           0.00% |
| FT           | 1_4_0            | percentile-25 |           0.00% |           0.00% |           0.00% |           0.00% |
| FT           | 1_4_0            | percentile-50 |           0.00% |           0.00% |           5.00% |          10.00% |
| FT           | 1_4_0            | percentile-75 |           0.00% |          15.00% |          30.00% |          40.00% |
| FT           | 1_4_0            | percentile-90 |          20.00% |          50.00% |          65.00% |         100.00% |
| FT           | 1_4_0            | average       |           5.08% |          12.58% |          20.00% |          25.49% |
| FT           | reduce_proximity | percentile-10 |           0.00% |           0.00% |           0.00% |           0.00% |
| FT           | reduce_proximity | percentile-25 |           0.00% |           0.00% |           0.00% |           0.00% |
| FT           | reduce_proximity | percentile-50 |           0.00% |           0.00% |           5.00% |          10.00% |
| FT           | reduce_proximity | percentile-75 |           0.00% |          15.00% |          30.00% |          40.00% |
| FT           | reduce_proximity | percentile-90 |          10.00% |          45.00% |          60.00% |         100.00% |
| FT           | reduce_proximity | average       |           5.01% |          12.64% |          20.10% |          25.53% |
| LAT          | 1_4_0            | percentile-10 |           0.00% |           0.00% |           0.00% |           0.00% |
| LAT          | 1_4_0            | percentile-25 |           0.00% |           0.00% |           0.00% |           0.00% |
| LAT          | 1_4_0            | percentile-50 |           0.00% |           0.00% |           5.00% |           5.00% |
| LAT          | 1_4_0            | percentile-75 |           5.00% |          15.00% |          30.00% |          30.00% |
| LAT          | 1_4_0            | percentile-90 |          15.00% |          45.00% |          60.00% |          80.00% |
| LAT          | 1_4_0            | average       |           4.80% |          11.80% |          17.88% |          21.62% |
| LAT          | reduce_proximity | percentile-10 |           0.00% |           0.00% |           0.00% |           0.00% |
| LAT          | reduce_proximity | percentile-25 |           0.00% |           0.00% |           0.00% |           0.00% |
| LAT          | reduce_proximity | percentile-50 |           0.00% |           0.00% |           5.00% |           5.00% |
| LAT          | reduce_proximity | percentile-75 |           0.00% |          11.11% |          25.00% |          35.00% |
| LAT          | reduce_proximity | percentile-90 |          15.00% |          45.00% |          55.00% |          80.00% |
| LAT          | reduce_proximity | average       |           4.43% |          11.23% |          17.32% |          21.45% |
</details>
### Impact on Search time
| dataset_name | host_name        |      25.00% |      50.00% |      75.00% |     100.00% | Average     |
|--------------|------------------|------------:|------------:|------------:|------------:|-------------|
| FBIS         | 1_4_0            |        3.45 | 7.446666667 | 9.773489933 | 9.620300752 | 7.572614338 |
| FBIS         | reduce_proximity | 2.983333333 | 5.316666667 | 6.911073826 | 7.637218045 | 5.712072968 |
| FR94         | 1_4_0            | 2.236666667 |        4.45 | 5.523489933 | 4.560150376 | 4.192576744 |
| FR94         | reduce_proximity |        2.09 | 3.991666667 | 4.981543624 | 4.266917293 | 3.832531896 |
| FT           | 1_4_0            | 5.956666667 | 9.656666667 | 13.86912752 | 10.83270677 |  10.0787919 |
| FT           | reduce_proximity |        4.51 | 5.981666667 | 7.701342282 | 6.766917293 |  6.23998156 |
| LAT          | 1_4_0            | 5.856666667 | 9.233333333 | 12.98322148 | 10.78759398 | 9.715203865 |
| LAT          | reduce_proximity |        6.91 | 6.706666667 | 8.463087248 | 8.265037594 | 7.586197877 |
## Technical approach
- Ensure the MAX_DISTANCE constant is used everywhere needed
- Reduce the MAX_DISTANCE from 8 to 4
## Related
TBD
Co-authored-by: ManyTheFish <many@meilisearch.com > 
						
						
					 
					
						2023-10-18 14:56:08 +00:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						27eec21415 
					 
					
						
						
							
							Fix tests  
						
						
						
						
					 
					
						2023-10-18 16:03:22 +02:00 
						 
				 
			
				
					
						
							
							
								bwbonanno 
							
						 
					 
					
						
						
							
						
						2b3adef796 
					 
					
						
						
							
							Use index_scheduler from configured app_data in middleware  
						
						
						
						
					 
					
						2023-10-17 08:17:13 -07:00 
						 
				 
			
				
					
						
							
							
								bwbonanno 
							
						 
					 
					
						
						
							
						
						956cfc5487 
					 
					
						
						
							
							Add runtime check to metrics middleware  
						
						
						
						
					 
					
						2023-10-16 13:48:57 -07:00 
						 
				 
			
				
					
						
							
							
								bwbonanno 
							
						 
					 
					
						
						
							
						
						12fc878640 
					 
					
						
						
							
							Merge remote-tracking branch 'origin/main' into enable-metrics-http  
						
						
						
						
					 
					
						2023-10-16 13:48:01 -07:00