MeiliSearch/crates/meilisearch/tests
meili-bors[bot] 19e6f675b3
Merge #4900
4900: Indexer edition 2024 r=Kerollmops a=dureuill

This PR is implementing the indexer edition 2024, largely inspired by [the ideas from this blog post](https://blog.kerollmops.com/meilisearch-is-too-slow).

Fixes https://github.com/meilisearch/meilisearch/issues/4985

## Features
- Stream-first approach to reading documents.
- Minimum disk write operations.
- RAM usage-first approach to avoid modifying common bitmaps on disk but in memory.
- Reduced LMDB fragmentation by writing entries only once...
- ...computing the final version of the entries in parallel...
- ...and storing them in write-optimized data structures before sending them to the BTree (LMDB).
- Indexing in multiple transactions to improve large dataset support (dumps).


Co-authored-by: ManyTheFish <many@meilisearch.com>
Co-authored-by: Clément Renault <clement@meilisearch.com>
Co-authored-by: Louis Dureuil <louis@meilisearch.com>
2024-11-21 16:19:10 +00:00
..
assets Move crates under a sub folder to clean up the code 2024-10-21 08:18:43 +02:00
auth Move crates under a sub folder to clean up the code 2024-10-21 08:18:43 +02:00
batches improve the count of the number of tasks in a batch 2024-11-20 17:48:26 +01:00
common add the front end tests on the batches routes 2024-11-20 10:51:06 +01:00
dashboard Move crates under a sub folder to clean up the code 2024-10-21 08:18:43 +02:00
documents update snaps after merge 2024-11-20 17:08:30 +01:00
dumps add the batch_id to the tasks 2024-11-20 10:42:54 +01:00
features Move crates under a sub folder to clean up the code 2024-10-21 08:18:43 +02:00
index Move crates under a sub folder to clean up the code 2024-10-21 08:18:43 +02:00
logs Merge branch 'main' into tmp-release-v1.11.0 2024-11-04 16:14:44 +01:00
search Merge branch 'main' into indexer-edition-2024 2024-11-20 16:59:58 +01:00
settings Move crates under a sub folder to clean up the code 2024-10-21 08:18:43 +02:00
similar Move crates under a sub folder to clean up the code 2024-10-21 08:18:43 +02:00
snapshot remove useless prints 2024-11-20 12:27:12 +01:00
stats Move crates under a sub folder to clean up the code 2024-10-21 08:18:43 +02:00
swap_indexes add the batch_id to the tasks 2024-11-20 10:42:54 +01:00
tasks update the tests after the rebase 2024-11-20 10:54:38 +01:00
vector First batch of PR comment 2024-11-21 16:38:11 +01:00
content_type.rs Move crates under a sub folder to clean up the code 2024-10-21 08:18:43 +02:00
integration.rs add the front end tests on the batches routes 2024-11-20 10:51:06 +01:00