ManyTheFish 
							
						 
					 
					
						
						
							
						
						ff87b4db26 
					 
					
						
						
							
							Avoid running proximity when only the exact attributes changes  
						
						
						
						
					 
					
						2024-06-05 12:48:44 +02:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						ba9fadc8f1 
					 
					
						
						
							
							Put only_additional_fields to None if the difference gives an empty result.  
						
						
						
						
					 
					
						2024-06-05 10:51:16 +02:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						d29d4f88da 
					 
					
						
						
							
							Skip iterating over documents when the faceted field list doesn't change  
						
						
						
						
					 
					
						2024-06-04 15:31:24 +02:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						17c5ceeb9d 
					 
					
						
						
							
							iterate over the faceted fields instead of over the whole document  
						
						
						
						
					 
					
						2024-06-04 14:04:20 +02:00 
						 
				 
			
				
					
						
							
							
								meili-bors[bot] 
							
						 
					 
					
						
						
							
						
						fc584f1db3 
					 
					
						
						
							
							Merge  #4666  
						
						... 
						
						
						
						4666: Add a score threshold search parameter r=ManyTheFish a=dureuill
# Pull Request
## Related issue
Fixes https://github.com/meilisearch/meilisearch/issues/4609 
## What does this PR do?
- See [usage](https://meilisearch.notion.site/Filter-by-score-usage-224a183ce7b24ca99b6a9a8da755668a?pvs=25#95b76ded400342ba9ab3d67c734836f0 ) and [the known limitation](https://meilisearch.notion.site/Filter-by-score-usage-224a183ce7b24ca99b6a9a8da755668a?pvs=25#e4e32195bf0e4195b5daecdbb7a97a17 )
Co-authored-by: Louis Dureuil <louis@meilisearch.com > 
						
						
					 
					
						2024-06-03 08:42:44 +00:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						2b6db6541e 
					 
					
						
						
							
							Changes after review  
						
						
						
						
					 
					
						2024-06-03 10:30:00 +02:00 
						 
				 
			
				
					
						
							
							
								meili-bors[bot] 
							
						 
					 
					
						
						
							
						
						d6bd88ce4f 
					 
					
						
						
							
							Merge  #4667  
						
						... 
						
						
						
						4667: Frequency matching strategy r=Kerollmops a=ManyTheFish
# Pull Request
## Related issue
Fixes  #3773 
## What does this PR do?
- add test for matching strategy
- implement frequency matching strategy
See the [PRD for more details](https://www.notion.so/meilisearch/Frequency-Matching-Strategy-0f3ba08833a442a39590a53a1505ab00 ).
[Public API](https://www.notion.so/meilisearch/frequency-matching-strategy-89868fb7fc584026bc56e378eb854a7f ).
Co-authored-by: ManyTheFish <many@meilisearch.com > 
						
						
					 
					
						2024-05-30 14:53:31 +00:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						b9a0ff0dd6 
					 
					
						
						
							
							Cache a lot of operations to know if a field must be indexed  
						
						
						
						
					 
					
						2024-05-30 16:18:23 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						75496af985 
					 
					
						
						
							
							Add a span for the prepare_for_documents_reindexing  
						
						
						
						
					 
					
						2024-05-30 12:14:22 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						0e9eb9eedb 
					 
					
						
						
							
							Add a span for the settings diff creation  
						
						
						
						
					 
					
						2024-05-30 12:08:27 +02:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						3f1a510069 
					 
					
						
						
							
							Add tests and fix matching strategy  
						
						
						
						
					 
					
						2024-05-30 12:02:42 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						3a78e988da 
					 
					
						
						
							
							Reduce the number of complex calls to settings diff functions  
						
						
						
						
					 
					
						2024-05-30 11:23:07 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						d9e5074189 
					 
					
						
						
							
							Introduce a new way to determine the operations to perform on the fields  
						
						
						
						
					 
					
						2024-05-30 11:23:07 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						bc210bdc00 
					 
					
						
						
							
							Introduce a dedicated function to write proximity entries in database  
						
						
						
						
					 
					
						2024-05-30 11:23:06 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						4bf83f701c 
					 
					
						
						
							
							Give the settings diff to the write_typed_chunk_into_index function  
						
						
						
						
					 
					
						2024-05-30 11:23:06 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						db3887929f 
					 
					
						
						
							
							Fix an issue with settings diff and * in the searchable attributes  
						
						
						
						
					 
					
						2024-05-30 11:22:50 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						9af103a88e 
					 
					
						
						
							
							Introducing a new into_del_add_obkv_conditional_operation function  
						
						
						
						
					 
					
						2024-05-30 11:22:49 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						99211eb375 
					 
					
						
						
							
							Introduce the SettingDiff only_additional_fields method  
						
						
						
						
					 
					
						2024-05-30 11:22:49 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						4f03b0cf5b 
					 
					
						
						
							
							Add ranking score threshold to similar  
						
						
						
						
					 
					
						2024-05-30 11:20:50 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						c26db7878c 
					 
					
						
						
							
							Expose rankingScoreThreshold in API  
						
						
						
						
					 
					
						2024-05-30 10:32:35 +02:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						1ab88e10b9 
					 
					
						
						
							
							Merge branch 'main' into merge-release-v1.8.1-in-main  
						
						
						
						
					 
					
						2024-05-29 16:24:00 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						aac1d769a7 
					 
					
						
						
							
							Add ranking_score_threshold to milli  
						
						
						
						
					 
					
						2024-05-29 14:17:09 +02:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						abdc4afcca 
					 
					
						
						
							
							Implement Frequency matching strategy  
						
						
						
						
					 
					
						2024-05-29 13:59:08 +02:00 
						 
				 
			
				
					
						
							
							
								Many the fish 
							
						 
					 
					
						
						
							
						
						e1fbfde6c4 
					 
					
						
						
							
							Merge branch 'main' into merge-release-v1.8.1-in-main  
						
						
						
						
					 
					
						2024-05-29 11:31:03 +02:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						27b75ec648 
					 
					
						
						
							
							merge main into v1.8.1  
						
						
						
						
					 
					
						2024-05-29 11:26:07 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						ca6cc4654b 
					 
					
						
						
							
							Add similar route  
						
						
						
						
					 
					
						2024-05-28 15:28:19 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						d35278320e 
					 
					
						
						
							
							Add support functions for accessing arroy writers and readers  
						
						
						
						
					 
					
						2024-05-28 15:27:43 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						02b3d82c60 
					 
					
						
						
							
							filtered_universe accepts index and txn instead of SearchContext  
						
						
						
						
					 
					
						2024-05-28 15:22:12 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						fd2c95999d 
					 
					
						
						
							
							Change validate_document_id to public and remove extra layer of result  
						
						
						
						
					 
					
						2024-05-28 15:21:19 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						dc949ab46a 
					 
					
						
						
							
							Remove puffin usage  
						
						
						
						
					 
					
						2024-05-27 15:59:14 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						7f3e51349e 
					 
					
						
						
							
							Remove puffin for the dependencies  
						
						
						
						
					 
					
						2024-05-27 15:53:06 +02:00 
						 
				 
			
				
					
						
							
							
								meili-bors[bot] 
							
						 
					 
					
						
						
							
						
						19acc65ad2 
					 
					
						
						
							
							Merge  #4646  
						
						... 
						
						
						
						4646: Reduce `Transform`'s disk usage r=Kerollmops a=Kerollmops
This PR implements what is described in #4485 . It reduces the number of disk writes and disk usage.
Co-authored-by: Clément Renault <clement@meilisearch.com > 
						
						
					 
					
						2024-05-23 16:06:50 +00:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						fe17c0f52e 
					 
					
						
						
							
							Construct the minimal OBKVs according to the settings diff  
						
						
						
						
					 
					
						2024-05-23 11:23:57 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						bc5663e673 
					 
					
						
						
							
							FieldIdsMap no longer useful thanks to  #4631  
						
						
						
						
					 
					
						2024-05-22 16:06:15 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						8a941c0241 
					 
					
						
						
							
							Smaller review changes  
						
						
						
						
					 
					
						2024-05-22 14:44:42 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						3412e7fbcf 
					 
					
						
						
							
							"[]" is deserialized as 0 embedding rather than 1 embedding of dim 0  
						
						
						
						
					 
					
						2024-05-22 12:25:21 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						16037e2169 
					 
					
						
						
							
							Don't remove embedders that are not in the config from the document DB  
						
						
						
						
					 
					
						2024-05-22 12:24:51 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						8f7c8ca7f0 
					 
					
						
						
							
							Remove now unused error variant  
						
						
						
						
					 
					
						2024-05-22 12:23:43 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						500ddc76b5 
					 
					
						
						
							
							Make the flattened sorter optional  
						
						
						
						
					 
					
						2024-05-21 16:16:36 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						943f8dba0c 
					 
					
						
						
							
							Make clippy happy  
						
						
						
						
					 
					
						2024-05-21 14:58:41 +02:00 
						 
				 
			
				
					
						
							
							
								Clément Renault 
							
						 
					 
					
						
						
							
						
						1aa8ed9ef7 
					 
					
						
						
							
							Make the original sorter optional  
						
						
						
						
					 
					
						2024-05-21 14:53:26 +02:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						f762307838 
					 
					
						
						
							
							Fix clippy  
						
						
						
						
					 
					
						2024-05-21 13:44:20 +02:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						3e94a90722 
					 
					
						
						
							
							Fixes  
						
						
						
						
					 
					
						2024-05-21 13:39:46 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						b17cb56dee 
					 
					
						
						
							
							Test array of vectors  
						
						
						
						
					 
					
						2024-05-20 14:44:10 +02:00 
						 
				 
			
				
					
						
							
							
								ManyTheFish 
							
						 
					 
					
						
						
							
						
						fc7e817221 
					 
					
						
						
							
							Index geo points based on the settings differences  
						
						
						
						
					 
					
						2024-05-20 12:27:26 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						d05d49ffd8 
					 
					
						
						
							
							Fix tests  
						
						
						
						
					 
					
						2024-05-20 10:36:18 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						0462ebbe58 
					 
					
						
						
							
							Don't write an empty _vectors field  
						
						
						
						
					 
					
						2024-05-20 10:36:18 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						2f7a8a4efb 
					 
					
						
						
							
							Don't write vectors that weren't autogenerated in document DB  
						
						
						
						
					 
					
						2024-05-20 10:36:18 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						52d9cb6e5a 
					 
					
						
						
							
							Refactor vector indexing  
						
						... 
						
						
						
						- use the parsed_vectors module
- only parse `_vectors` once per document, instead of once per embedder per document 
						
						
					 
					
						2024-05-20 10:36:17 +02:00 
						 
				 
			
				
					
						
							
							
								Louis Dureuil 
							
						 
					 
					
						
						
							
						
						261de888b7 
					 
					
						
						
							
							Add function to get the embeddings of a document in an index  
						
						
						
						
					 
					
						2024-05-20 10:36:17 +02:00