mirror of
				https://github.com/meilisearch/meilisearch.git
				synced 2025-10-25 13:06:27 +00:00 
			
		
		
		
	Merge #3834
3834: Define searchable fields at runtime r=Kerollmops a=ManyTheFish
## Summary
This feature allows the end-user to search in one or multiple attributes using the search parameter `attributesToSearchOn`:
```json
{
  "q": "Captain Marvel",
  "attributesToSearchOn": ["title"]
}
```
This feature act like a filter, forcing Meilisearch to only return the documents containing the requested words in the attributes-to-search-on. Note that, with the matching strategy `last`, Meilisearch will only ensure that the first word is in the attributes-to-search-on, but, the retrieved documents will be ordered taking into account the word contained in the attributes-to-search-on. 
## Trying the prototype
A dedicated docker image has been released for this feature:
#### last prototype version:
```bash
docker pull getmeili/meilisearch:prototype-define-searchable-fields-at-search-time-1
```
#### others prototype versions:
```bash
docker pull getmeili/meilisearch:prototype-define-searchable-fields-at-search-time-0
```
## Technical Detail
The attributes-to-search-on list is given to the search context, then, the search context uses the `fid_word_docids`database using only the allowed field ids instead of the global `word_docids` database. This is the same for the prefix databases.
The database cache is updated with the merged values, meaning that the union of the field-id-database values is only made if the requested key is missing from the cache.
### Relevancy limits
Almost all ranking rules behave as expected when ordering the documents.
Only `proximity` could miss-order documents if all the searched words are in the restricted attribute but a better proximity is found in an ignored attribute in a document that should be ranked lower. I put below a failing test showing it:
```rust
#[actix_rt::test]
async fn proximity_ranking_rule_order() {
    let server = Server::new().await;
    let index = index_with_documents(
        &server,
        &json!([
        {
            "title": "Captain super mega cool. A Marvel story",
            // Perfect distance between words in an ignored attribute
            "desc": "Captain Marvel",
            "id": "1",
        },
        {
            "title": "Captain America from Marvel",
            "desc": "a Shazam ersatz",
            "id": "2",
        }]),
    )
    .await;
    // Document 2 should appear before document 1.
    index
        .search(json!({"q": "Captain Marvel", "attributesToSearchOn": ["title"], "attributesToRetrieve": ["id"]}), |response, code| {
            assert_eq!(code, 200, "{}", response);
            assert_eq!(
                response["hits"],
                json!([
                    {"id": "2"},
                    {"id": "1"},
                ])
            );
        })
        .await;
}
```
Fixing this would force us to create a `fid_word_pair_proximity_docids` and a `fid_word_prefix_pair_proximity_docids` databases which may multiply the keys of `word_pair_proximity_docids` and `word_prefix_pair_proximity_docids` by the number of attributes in the searchable_attributes list. If we think we should fix this test, I'll suggest doing it in another PR.
## Related
Fixes #3772
Co-authored-by: Tamo <tamo@meilisearch.com>
Co-authored-by: ManyTheFish <many@meilisearch.com>
			
			
This commit is contained in:
		| @@ -226,6 +226,7 @@ InvalidIndexLimit                     , InvalidRequest       , BAD_REQUEST ; | ||||
| InvalidIndexOffset                    , InvalidRequest       , BAD_REQUEST ; | ||||
| InvalidIndexPrimaryKey                , InvalidRequest       , BAD_REQUEST ; | ||||
| InvalidIndexUid                       , InvalidRequest       , BAD_REQUEST ; | ||||
| InvalidAttributesToSearchOn   , InvalidRequest       , BAD_REQUEST ; | ||||
| InvalidSearchAttributesToCrop         , InvalidRequest       , BAD_REQUEST ; | ||||
| InvalidSearchAttributesToHighlight    , InvalidRequest       , BAD_REQUEST ; | ||||
| InvalidSearchAttributesToRetrieve     , InvalidRequest       , BAD_REQUEST ; | ||||
| @@ -336,6 +337,9 @@ impl ErrorCode for milli::Error { | ||||
|                     UserError::SortRankingRuleMissing => Code::InvalidSearchSort, | ||||
|                     UserError::InvalidFacetsDistribution { .. } => Code::InvalidSearchFacets, | ||||
|                     UserError::InvalidSortableAttribute { .. } => Code::InvalidSearchSort, | ||||
|                     UserError::InvalidSearchableAttribute { .. } => { | ||||
|                         Code::InvalidAttributesToSearchOn | ||||
|                     } | ||||
|                     UserError::CriterionError(_) => Code::InvalidSettingsRankingRules, | ||||
|                     UserError::InvalidGeoField { .. } => Code::InvalidDocumentGeoField, | ||||
|                     UserError::InvalidVectorDimensions { .. } => Code::InvalidVectorDimensions, | ||||
|   | ||||
		Reference in New Issue
	
	Block a user