MeiliSearch

mirror of https://github.com/meilisearch/MeiliSearch synced 2025-07-01 10:58:30 +02:00

Go to file

meili-bors[bot] 101f5a20d2

3757: Adjust the cost of edges in the `position` ranking rule by bucketing positions more aggressively r=loiclec a=loiclec

This PR significantly improves the performance of the `position` ranking rule when:
1. a query contains many words
2. the `position` ranking rule needs to be called many times
3. the score of the documents according to `position` is high

These conditions greatly increase:
1. the number of edge traversals that are needed to find a valid path from the `start` node to the `end` node
2. the number of edges that need to be deleted from the graph, and therefore the number of times that we need to recompute all the possible costs from START to END

As a result, a majority of the search time is spent in `visit_condition`, `visit_node`, and `update_all_costs_before_node`. This is frustrating because it often happens when the "universe" given to the rule consists of only a handful of document ids.

By limiting the number of possible edges between two nodes from `20` to `10`, we:
1. reduce the number of possible costs from START to END
2. reduce the number of edges that will be deleted
3. make it faster to update the costs after deleting an edge
4. reduce the number of buckets that need to be computed

In terms of relevancy, I don't think we lose or gain much. We still prefer terms that are in a lower positions, with decreasing precision as we go further. The previous choice of bucketing wasn't chosen in a principled way, and neither is this one. They both "feel" right to me.

Co-authored-by: Loïc Lecrenier <loic.lecrenier@me.com>
Co-authored-by: meili-bors[bot] <89034592+meili-bors[bot]@users.noreply.github.com>

2023-05-17 11:43:59 +00:00

.github

Add SDKs test in a CI

2023-05-02 11:53:28 +02:00

assets

Add a README to the milli crate

2023-01-16 16:25:12 +01:00

benchmarks

Allow to disable specialized tokenizations (again)

2023-05-04 15:45:40 +02:00

dump

handle the array of array form of filter in the dumps

2023-05-03 17:41:50 +02:00

file-store

Upgrade the compatible versions of the dependencies

2023-04-24 17:50:52 +02:00

filter-parser

Merge #3571

2023-04-27 13:14:00 +00:00

flatten-serde-json

Fix the tests to make flattening work

2023-03-15 14:12:34 +01:00

grafana-dashboards

Add suffix describing the unit when needed; Replace MeiliSearch by Meilisearch; Precised some metrics name

2022-08-23 17:09:27 +02:00

index-scheduler

fix the error code in case of not filterable attributes on the get / delete documents by filter routes

2023-05-16 13:56:18 +02:00

json-depth-checker

Use the workspace inheritance feature of rust 1.64

2023-02-15 13:51:07 +01:00

meili-snap

Upgrade the compatible versions of the dependencies

2023-04-24 17:50:52 +02:00

meilisearch

Merge #3738

2023-05-16 19:37:41 +00:00

meilisearch-auth

Use the writemap flag to reduce the memory usage

2023-05-15 10:15:33 +02:00

meilisearch-types

Merge #3687

2023-05-04 14:48:01 +00:00

milli

Merge #3757

2023-05-17 11:43:59 +00:00

permissive-json-pointer

Use the workspace inheritance feature of rust 1.64

2023-02-15 13:51:07 +01:00

.dockerignore

Update .dockerignore

2023-04-25 16:48:25 +02:00

.gitignore

edit gitignore to ignore .idea and .vscode folders

2023-02-10 11:42:19 +04:00

.rustfmt.toml

Introduce a rustfmt file

2022-10-27 11:35:05 +02:00

bors.toml

Remove macos-latest and windows-latest usages

2022-12-20 11:10:09 +01:00

Cargo.lock

Use the new heed v0.12.6

2023-05-15 11:42:30 +02:00

Cargo.toml

Update version for the next release (v1.2.0) in Cargo.toml

2023-05-08 17:52:33 +00:00

CODE_OF_CONDUCT.md

Create CODE_OF_CONDUCT.md

2020-04-30 20:16:02 +02:00

config.toml

Add the option in the config file

2023-05-15 16:07:43 +02:00

CONTRIBUTING.md

Update links of the docs

2023-05-03 19:14:57 +02:00

Cross.toml

Cross build with action-rs

2021-10-10 02:21:30 +08:00

Dockerfile

revert mount

2023-04-18 15:15:33 +09:00

download-latest.sh

Update links of the docs

2023-05-03 19:14:57 +02:00

LICENSE

Update LICENSE

2022-02-15 15:54:45 +01:00

README.md

Merge #3720

2023-05-04 10:07:41 +00:00

SECURITY.md

docs(security): Fix Supported

2022-05-31 14:21:34 -05:00

README.md

Website | Roadmap | Blog | Documentation | FAQ | Discord

⚡ A lightning-fast search engine that fits effortlessly into your apps, websites, and workflow 🔍

Meilisearch helps you shape a delightful search experience in a snap, offering features that work out-of-the-box to speed up your workflow.

🔥 Try it! 🔥

✨ Features

Search-as-you-type: find search results in less than 50 milliseconds
Typo tolerance: get relevant matches even when queries contain typos and misspellings
Filtering and faceted search: enhance your user's search experience with custom filters and build a faceted search interface in a few lines of code
Sorting: sort results based on price, date, or pretty much anything else your users need
Synonym support: configure synonyms to include more relevant content in your search results
Geosearch: filter and sort documents based on geographic data
Extensive language support: search datasets in any language, with optimized support for Chinese, Japanese, Hebrew, and languages using the Latin alphabet
Security management: control which users can access what data with API keys that allow fine-grained permissions handling
Multi-Tenancy: personalize search results for any number of application tenants
Highly Customizable: customize Meilisearch to your specific needs or use our out-of-the-box and hassle-free presets
RESTful API: integrate Meilisearch in your technical stack with our plugins and SDKs
Easy to install, deploy, and maintain

📖 Documentation

You can consult Meilisearch's documentation at https://www.meilisearch.com/docs.

🚀 Getting started

For basic instructions on how to set up Meilisearch, add documents to an index, and search for documents, take a look at our Quick Start guide.

You may also want to check out Meilisearch 101 for an introduction to some of Meilisearch's most popular features.

☁️ Meilisearch cloud

Let us manage your infrastructure so you can focus on integrating a great search experience. Try Meilisearch Cloud today.

🧰 SDKs & integration tools

Install one of our SDKs in your project for seamless integration between Meilisearch and your favorite language or framework!

Take a look at the complete Meilisearch integration list.

⚙️ Advanced usage

Experienced users will want to keep our API Reference close at hand.

We also offer a wide range of dedicated guides to all Meilisearch features, such as filtering, sorting, geosearch, API keys, and tenant tokens.

Finally, for more in-depth information, refer to our articles explaining fundamental Meilisearch concepts such as documents and indexes.

📊 Telemetry

Meilisearch collects anonymized data from users to help us improve our product. You can deactivate this whenever you want.

To request deletion of collected data, please write to us at privacy@meilisearch.com. Don't forget to include your Instance UID in the message, as this helps us quickly find and delete your data.

If you want to know more about the kind of data we collect and what we use it for, check the telemetry section of our documentation.

📫 Get in touch!

Meilisearch is a search engine created by Meili, a software development company based in France and with team members all over the world. Want to know more about us? Check out our blog!

🗞 Subscribe to our newsletter if you don't want to miss any updates! We promise we won't clutter your mailbox: we only send one edition every two months.

💌 Want to make a suggestion or give feedback? Here are some of the channels where you can reach us:

For feature requests, please visit our product repository
Found a bug? Open an issue!
Want to be part of our Discord community? Join us!

Thank you for your support!

👩‍💻 Contributing

Meilisearch is, and will always be, open-source! If you want to contribute to the project, please take a look at our contribution guidelines.

📦 Versioning

Meilisearch releases and their associated binaries are available in this GitHub page.

The binaries are versioned following SemVer conventions. To know more, read our versioning policy.

Differently from the binaries, crates in this repository are not currently available on crates.io and do not follow SemVer conventions.