MeiliSearch

mirror of https://github.com/meilisearch/MeiliSearch synced 2024-11-26 14:54:27 +01:00

Author	SHA1	Message	Date
Louis Dureuil	09d9b63e1c	- test case where all vectors were generated - update tests following changes in behavior from previous commit	2024-06-13 17:16:41 +02:00
Louis Dureuil	b9b938c902	Change `retrieveVectors` behavior: - when the feature is disabled, documents are never modified - when the feature is enabled and `retrieveVectors` is disabled, `_vectors` is removed from documents - when the feature is enabled and `retrieveVectors` is enabled, vectors from the vectors DB are merged with `_vectors` in documents Additionally `_vectors` is never displayed when the `displayedAttributes` list does not contain either `*` or `_vectors` - fixed an issue where `_vectors` was not injected when all vectors in the dataset where always generated	2024-06-13 17:13:36 +02:00
Tamo	6bf07d969e	add failing test	2024-06-13 15:49:42 +02:00
Louis Dureuil	3f212a8202	Update tests	2024-06-12 18:13:34 +02:00
Louis Dureuil	3bc8f81abc	user_provided => regenerate	2024-06-12 18:12:20 +02:00
Louis Dureuil	fca9fe39b3	Update test snapshots	2024-06-12 14:50:55 +02:00
meili-bors[bot]	e0eff08095	Merge #4685 4685: Fix ci tests r=dureuill a=ManyTheFish # Pull Request Make the all following CI succeed: https://github.com/meilisearch/meilisearch/actions/runs/9477183091 ## Related issue Fixes #4629 ## What does this PR do? - Change the test behavior for `swedish-recomposition` feature flag - Remove the `-v` parameter from grep Co-authored-by: ManyTheFish <many@meilisearch.com> Co-authored-by: Many the fish <many@meilisearch.com>	2024-06-12 07:58:33 +00:00
Clément Renault	ee39309aae	Improve errors and introduce a new InvalidSearchDistinct error code	2024-06-11 16:03:39 -04:00
Clément Renault	0d31be1494	Make the distinct work at search	2024-06-11 11:39:35 -04:00
Tamo	3493093c4f	add a batch of tests	2024-06-11 16:03:54 +02:00
Tamo	600e97d9dc	gate the retrieveVectors parameter behind the vectors feature flag	2024-06-10 18:26:12 +02:00
Louis Dureuil	7559dfc814	Merge tag 'v1.8.2' into release-v1.9.0	2024-06-10 15:07:34 +02:00
Louis Dureuil	50f8218a5d	Asynchronously drop permits	2024-06-10 10:19:57 +02:00
ManyTheFish	57d066595b	fix Tests almost all features	2024-06-06 17:24:50 +02:00
Tamo	734d1c53ad	fix a panic in yaup	2024-06-06 16:31:07 +02:00
Tamo	63dded3961	implements the new analytics for the get documents routes	2024-06-06 11:39:29 +02:00
Tamo	2cdcb703d9	fix the deletion of vectors and add a test	2024-06-06 11:39:29 +02:00
Tamo	6607875f49	add the retrieveVectors parameter to the get and fetch documents route	2024-06-06 11:39:29 +02:00
Tamo	31a793d226	fix the regeneration of the embeddings in the search	2024-06-06 11:39:29 +02:00
Tamo	d85ab23b82	rename all occurences of user_defined to user_provided for consistency	2024-06-06 11:39:29 +02:00
Tamo	49fa41ce65	apply first round of review comments	2024-06-06 11:39:29 +02:00
Tamo	400cf3eb92	add api error test on the new retrieveVectors parameter	2024-06-06 11:39:29 +02:00
Tamo	376b3a19a7	makes clippy and fmt happy	2024-06-06 11:39:29 +02:00
Tamo	d92c173fdc	update the new similar tests	2024-06-06 11:39:29 +02:00
Tamo	6b29676e7e	update snapshots	2024-06-06 11:39:29 +02:00
Tamo	caad40964a	implements the analytics	2024-06-06 11:39:29 +02:00
Tamo	cc5dca8321	fix two bug and add a dump test	2024-06-06 11:39:29 +02:00
Tamo	5d50850e12	always push the user defined vectors in arroy	2024-06-06 11:39:29 +02:00
Tamo	04f6523f3c	expose a new parameter to retrieve the embedders at search time	2024-06-06 11:36:11 +02:00
meili-bors[bot]	98e062a714	Merge #4675 4675: Update actix-web 4.5.1 -> 4.6.0 r=dureuill a=dureuill # Pull Request - actix-web 4.5.1 -> 4.6.0 - actix-http 3.6.0 -> 3.7.0 - actix-web-static-files (commit 2d3b6160) -> 4.0.1 - tracing-actix-web 0.7.9 -> 0.7.10 - brotli 3.4.0 -> 6.0.0 ## Related issue Fixes #4625 Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-06-05 07:40:35 +00:00
Louis Dureuil	8412665957	Update actix-web 4.5.1 -> 4.6.0	2024-06-04 09:54:30 +02:00
meili-bors[bot]	fc584f1db3	Merge #4666 4666: Add a score threshold search parameter r=ManyTheFish a=dureuill # Pull Request ## Related issue Fixes https://github.com/meilisearch/meilisearch/issues/4609 ## What does this PR do? - See [usage](https://meilisearch.notion.site/Filter-by-score-usage-224a183ce7b24ca99b6a9a8da755668a?pvs=25#95b76ded400342ba9ab3d67c734836f0) and [the known limitation](https://meilisearch.notion.site/Filter-by-score-usage-224a183ce7b24ca99b6a9a8da755668a?pvs=25#e4e32195bf0e4195b5daecdbb7a97a17) Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-06-03 08:42:44 +00:00
Louis Dureuil	2b6db6541e	Changes after review	2024-06-03 10:30:00 +02:00
meili-bors[bot]	d6bd88ce4f	Merge #4667 4667: Frequency matching strategy r=Kerollmops a=ManyTheFish # Pull Request ## Related issue Fixes #3773 ## What does this PR do? - add test for matching strategy - implement frequency matching strategy See the [PRD for more details](https://www.notion.so/meilisearch/Frequency-Matching-Strategy-0f3ba08833a442a39590a53a1505ab00). [Public API](https://www.notion.so/meilisearch/frequency-matching-strategy-89868fb7fc584026bc56e378eb854a7f). Co-authored-by: ManyTheFish <many@meilisearch.com>	2024-05-30 14:53:31 +00:00
Louis Dureuil	c2fb7afe59	fmt	2024-05-30 12:06:46 +02:00
ManyTheFish	3f1a510069	Add tests and fix matching strategy	2024-05-30 12:02:42 +02:00
Louis Dureuil	41976b82b1	Tests for ranking_score_threshold	2024-05-30 11:22:26 +02:00
Louis Dureuil	c36410fcbf	Analytics for ranking score threshold	2024-05-30 11:22:12 +02:00
Louis Dureuil	7ce2691374	Add ranking score threshold to similar API	2024-05-30 11:21:31 +02:00
Louis Dureuil	c26db7878c	Expose rankingScoreThreshold in API	2024-05-30 10:32:35 +02:00
ManyTheFish	1ab88e10b9	Merge branch 'main' into merge-release-v1.8.1-in-main	2024-05-29 16:24:00 +02:00
ManyTheFish	6a4b2516aa	WIP	2024-05-29 16:21:24 +02:00
Louis Dureuil	2cf3e1c80a	Temporarily ignore perform snapshot test under Windows	2024-05-29 12:42:47 +02:00
Many the fish	e1fbfde6c4	Merge branch 'main' into merge-release-v1.8.1-in-main	2024-05-29 11:31:03 +02:00
Louis Dureuil	ca006e38ec	Basic tests	2024-05-28 15:28:19 +02:00
Louis Dureuil	e26bd87780	Error tests for similar routes	2024-05-28 15:28:19 +02:00
Louis Dureuil	c01e498a63	Test server can call similar	2024-05-28 15:28:19 +02:00
Louis Dureuil	ca6cc4654b	Add similar route	2024-05-28 15:28:19 +02:00
Louis Dureuil	3bd9d2478c	Add error codes	2024-05-28 15:27:43 +02:00
Louis Dureuil	54b15059a0	Analytics changes	2024-05-28 15:27:43 +02:00
Louis Dureuil	e172e938e7	add search rules directly takes the filter rather than the searchquery	2024-05-28 15:22:25 +02:00
Louis Dureuil	02b3d82c60	filtered_universe accepts index and txn instead of SearchContext	2024-05-28 15:22:12 +02:00
Clément Renault	487431a035	Fix tests	2024-05-27 16:12:20 +02:00
Clément Renault	b6d450d484	Remove puffin experimental feature	2024-05-27 15:59:28 +02:00
Clément Renault	7f3e51349e	Remove puffin for the dependencies	2024-05-27 15:53:06 +02:00
meili-bors[bot]	19acc65ad2	Merge #4646 4646: Reduce `Transform`'s disk usage r=Kerollmops a=Kerollmops This PR implements what is described in #4485. It reduces the number of disk writes and disk usage. Co-authored-by: Clément Renault <clement@meilisearch.com>	2024-05-23 16:06:50 +00:00
Clément Renault	fe17c0f52e	Construct the minimal OBKVs according to the settings diff	2024-05-23 11:23:57 +02:00
meili-bors[bot]	14bc80e3df	Merge #4633 4633: Allow to mark vectors as "userProvided" r=Kerollmops a=dureuill # Pull Request ## Related issue Fixes #4606 ## What does this PR do? [See usage in PRD](https://meilisearch.notion.site/v1-9-AI-search-changes-e90d6803eca8417aa70a1ac5d0225697#deb96fb0595947bda7d4a371100326eb) - Extends the shape of the special `_vectors` field in documents. - previously, the `_vectors` field had to be an object, with each field the name of a configured embedder, and each value either `null`, an embedding (array of numbers), or an array of embeddings. - In this PR, the value of an embedder in the `_vectors` field can additionally be an object. The object has two fields: 1. `embeddings`: `null`, an embedding (array of numbers), or an array of embeddings. 2. `userProvided`: a boolean indicating if the vector was provided by the user. - The previous form `embedder_or_array_of_embedders` is semantically equivalent to: ```json { "embeddings": embedder_or_array_of_embedders, "userProvided": true } ``` - During the indexing step, the subfields and values of the `_vectors` field that have `userProvided` set to false are added in the vector DB, but not in the documents DB: that means that future modifications of the documents will trigger a regeneration of that particular vector using the document template. - This allows importing embeddings as a one-shot process, while still retaining the ability to regenerate embeddings on document change. - The dump process now uses this ability: it enriches the `_vectors` fields of documents with the embeddings that were autogenerated, marking them as not `userProvided`. This allows importing the vectors from a dump without regenerating them. ### Tests This PR adds the following tests - Long-needed hybrid search tests of a simple hf embedder - Dump test that imports vectors. Due to the difficulty of actually importing a dump in tests, we just read the dump and check it contains the expected content. - Tests in the index-scheduler: this tests that documents containing the same kind of instructions as in the dump indexes as expected Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-05-23 08:17:54 +00:00
ManyTheFish	3e94a90722	Fixes	2024-05-21 13:39:46 +02:00
Tamo	7e251b43d4	Revert "Stream documents"	2024-05-20 15:09:45 +02:00
Louis Dureuil	afcd7b9f0c	Test hybrid search with hf embedder	2024-05-20 14:44:10 +02:00
Tamo	0f78703b85	add a test reproducing the bug	2024-05-20 10:58:08 +02:00
Louis Dureuil	d05d49ffd8	Fix tests	2024-05-20 10:36:18 +02:00
Tamo	273c6e8c5c	uses the latest version of heed to get rid of unsafe code	2024-05-16 18:31:32 +02:00
Tamo	c85d1752dd	keep the same rtxn to compute the filters on the documents and to stream the documents later on	2024-05-16 18:31:32 +02:00
Tamo	8e6ffbfc6f	stream documents	2024-05-16 18:31:32 +02:00
Tamo	673b6e1dc0	fix a flaky test	2024-05-16 11:28:14 +02:00
Tamo	f2d0a59f1d	when no searchable attributes are defined, makes all the weight equals to zero	2024-05-16 01:06:33 +02:00
Tamo	7ec4e2a3fb	apply all style review comments	2024-05-15 15:02:26 +02:00
Tamo	9fffb8e83d	make clippy happy	2024-05-14 17:36:32 +02:00
Tamo	caa6a7149a	make the attribute ranking rule use the weights and fix the tests	2024-05-14 17:36:32 +02:00
Clément Renault	f33a1282f8	Bump Rustls to v0.21.12	2024-05-07 10:31:39 +02:00
meili-bors[bot]	4d5971f343	Merge #4621 4621: Bring back changes from v1.8.0 into main r=curquiza a=curquiza Co-authored-by: ManyTheFish <many@meilisearch.com> Co-authored-by: Tamo <tamo@meilisearch.com> Co-authored-by: meili-bors[bot] <89034592+meili-bors[bot]@users.noreply.github.com> Co-authored-by: Clément Renault <clement@meilisearch.com>	2024-05-06 13:46:39 +00:00
Tamo	3698aef66b	fix warning	2024-05-06 11:36:37 +02:00
Simon Detheridge	7f5ab3cef5	Use http path pattern instead of full path in metrics	2024-05-03 12:29:31 +01:00
Louis Dureuil	5a305bfdea	Remove unused struct	2024-05-02 16:14:37 +02:00
ManyTheFish	88174b8ae4	Update charabia v0.8.10	2024-04-30 14:30:23 +02:00
meili-bors[bot]	c793b6ef6d	Merge #4600 4600: Fix embedders api r=ManyTheFish a=ManyTheFish # Pull Request ## Related issue Fixes #4594 Fixes #4595 Co-authored-by: ManyTheFish <many@meilisearch.com>	2024-04-25 13:16:33 +00:00
ManyTheFish	cbbfff3594	Remove debuging prints	2024-04-25 10:37:18 +02:00
ManyTheFish	7468c1cf8d	Introduce WildcardSetting that are serialized as wildcards by default	2024-04-24 18:15:03 +02:00
Clément Renault	d4aeff92d0	Introduce the ThreadPoolNoAbort wrapper	2024-04-24 16:40:12 +02:00
ManyTheFish	e87cb373de	Avoid intermediate serializing when displaying settings	2024-04-24 12:33:07 +02:00
Clément Renault	0c7003c5df	Introduce an atomic to catch panics in thread pools	2024-04-22 18:09:33 +02:00
meili-bors[bot]	aa0bbbb246	Merge #4578 4578: Remove useless analytics r=ManyTheFish a=irevoire # Pull Request ## Related issue Fixes #4577 ## What does this PR do? Remove the following analytics: - `Health Seen` - `Stats Seen` - `Task Seen` - `Version Seen` Co-authored-by: Tamo <tamo@meilisearch.com>	2024-04-18 13:30:42 +00:00
ManyTheFish	c71b5d09ff	Updatre charabia v0.8.9	2024-04-18 11:38:26 +02:00
meili-bors[bot]	2dfee2fad5	Merge #4580 4580: Update the search logs r=Kerollmops a=irevoire # Pull Request ## Related issue Fixes https://github.com/meilisearch/meilisearch/issues/4579 ## What does this PR do? - Update the debug implementation of the search query and search results so it’s way smaller and doesn’t display useless information Co-authored-by: Tamo <tamo@meilisearch.com>	2024-04-17 14:25:43 +00:00
Tamo	4a68e9f6ae	reorganize the debug implementation of the search results and only dispaly the meaningful informations	2024-04-17 13:42:10 +02:00
Tamo	206887c7a2	update the SearchQuery Debug implementation so it’s smaller and gives the most important informations first	2024-04-17 12:57:19 +02:00
Tamo	2dd9dd6d0a	remove the Health Seen analytic	2024-04-17 11:43:40 +02:00
Tamo	e1f27de51a	remove the Stats Seen analytic	2024-04-16 18:49:41 +02:00
Tamo	abae31aee0	remove the Task Seen analytic	2024-04-16 18:48:10 +02:00
Tamo	70ce0095ea	remove the Version Seen analytic	2024-04-16 18:48:03 +02:00
ManyTheFish	a1ea224da9	Fix tests	2024-04-16 17:29:34 +02:00
ManyTheFish	a489b406b4	fix test	2024-04-16 14:39:06 +02:00
Louis Dureuil	a9013ed683	Fix comment mistake Co-authored-by: Tamo <tamo@meilisearch.com>	2024-04-04 17:21:47 +02:00
Louis Dureuil	ca499a0302	Fix test after rebase	2024-04-04 16:04:07 +02:00
Louis Dureuil	355e5282b2	Remove `_semanticScore`	2024-04-04 16:04:07 +02:00
Louis Dureuil	7c27417a5d	Add tests	2024-04-04 16:04:07 +02:00
Louis Dureuil	1ff2a2d6fb	Add semanticHitCount	2024-04-04 16:04:06 +02:00
Louis Dureuil	4564a38ae7	Bail earlier when the experimental feature is not enabled	2024-04-04 15:58:19 +02:00
Louis Dureuil	6ebb6b55a6	Lazily embed, don't fail hybrid search on embedding failure	2024-04-04 15:58:17 +02:00
Louis Dureuil	190933f6e1	Breaking: Remove vector from SearchResult	2024-04-04 15:57:29 +02:00
Louis Dureuil	928e6e4c05	Breaking change: remove vector for score details	2024-04-04 15:57:29 +02:00
meili-bors[bot]	5509bafff8	Merge #4535 4535: Support Negative Keywords r=ManyTheFish a=Kerollmops This PR fixes #4422 by supporting `-` before any word in the query. The minus symbol `-`, from the ASCII table, is not the only character that can be considered the negative operator. You can see the two other matching characters under the `Based on "-" (U+002D)` section on [this unicode reference website](https://www.compart.com/en/unicode/U+002D). It's important to notice the strange behavior when a query includes and excludes the same word; only the derivative ( synonyms and split) will be kept: - If you input `progamer -progamer`, the engine will still search for `pro gamer`. - If you have the synonym `like = love` and you input `like -like`, it will still search for `love`. ## TODO - [x] Add analytics - [x] Add support to the `-` operator - [x] Make sure to support spaces around `-` well - [x] Support phrase negation - [x] Add tests Co-authored-by: Clément Renault <clement@meilisearch.com>	2024-04-04 13:10:27 +00:00
Clément Renault	90e812fc0b	Add some tests	2024-04-04 15:08:37 +02:00
meili-bors[bot]	56bf8503db	Merge #4537 4537: Expose distribution shift in settings r=ManyTheFish a=dureuill See [usage page](https://meilisearch.notion.site/v1-8-AI-search-API-usage-135552d6e85a4a52bc7109be82aeca42#d652adc0890445658aaf36352dbc8802) # Changes - Distribution shift added to all embedders. - Exposed in settings - Changed the reindexing logic to not trigger a reindex operation when only the distribution shift or API key change Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-04-03 09:08:58 +00:00
meili-bors[bot]	78668584cd	Merge #4533 4533: Hide api key in settings and task queue r=dureuill a=dureuill # Pull Request See [Usage page](https://meilisearch.notion.site/v1-8-AI-search-API-usage-135552d6e85a4a52bc7109be82aeca42#117f5ff7b19f4d95bb3ae0005f6c6633) ## Motivation See [slack discussion (internal link)](https://meilisearch.slack.com/archives/C06GQP7FQ6P/p1709804022298749) ## Changes - The value of the `apiKey` parameter is now hidden in the settings and the details of the task queue. Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-03-28 16:02:53 +00:00
meili-bors[bot]	fa9748cc99	Merge #4536 4536: Limit concurrent search requests r=ManyTheFish a=irevoire # Pull Request ## Related issue Fixes https://github.com/meilisearch/meilisearch/issues/4489 ## What does this PR do? - Adds a « search queue » that limits the number of search requests we can process at the same time and stores search requests to be processed - Process only one search request per core/thread (we use available_parallelism) - When the search queue is full, new search requests replace old ones randomly. The reason is that: - If we serve the oldest one first, like Typesense, we give the worst performances to everyone - If we serve the latest one, it gets too easy to DoS us (you just need to fill the queue with as many search requests as we can process simultaneously to ensure no other request will ever be processed) - By picking the search request randomly, we give a chance to recent search requests to be processed while ensuring that we can't be owned unless they fill our queue entirely and we start returning errors 5xx - Adds an experimental parameter to control the size of the queue - Adds a bunch of tests to ensure the search queue works correctly - Ensure the loop consuming the search queue is running in the health route and crashes if it’s not the case Co-authored-by: Tamo <tamo@meilisearch.com>	2024-03-28 15:01:52 +00:00
Tamo	06a11b5b21	Improve error message	2024-03-27 17:34:49 +01:00
Tamo	b7c582e4f3	connect the search queue with the health route	2024-03-27 15:49:43 +01:00
Tamo	03c886ac1b	adds a bit of documentation	2024-03-27 15:38:36 +01:00
Louis Dureuil	cde7ce4f44	Add test	2024-03-27 14:02:09 +01:00
Tamo	087a96d22e	fix flaky test	2024-03-27 11:05:37 +01:00
meili-bors[bot]	34dfea72cc	Merge #4509 4509: Rest embedder r=ManyTheFish a=dureuill Fixes #4531 See [Usage page](https://meilisearch.notion.site/v1-8-AI-search-API-usage-135552d6e85a4a52bc7109be82aeca42?pvs=25#e6f58c3b742c4effb4ddc625ce12ee16) ### Implementation changes - Remove tokio, futures, reqwests - Add a new `milli::vector::rest::Embedder` embedder - Update OpenAI and Ollama embedders to use the REST embedder internally - Make Embedder::embed a sync method - Add the new embedder source as described in the usage Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-03-27 09:27:46 +00:00
Tamo	3a1f458139	fix a flaky test	2024-03-26 21:06:55 +01:00
Tamo	55df9daaa0	adds a comment about the safety of an operation	2024-03-26 19:34:55 +01:00
Tamo	2e36f069c2	fmt imports	2024-03-26 19:23:55 +01:00
Tamo	8f5d9f501a	update the discussion link	2024-03-26 19:18:32 +01:00
Tamo	8127c9a115	handle the case of a queue of zero elements	2024-03-26 19:04:39 +01:00
Tamo	e7704f1fc1	add a test to ensure we effectively returns a retry-after when the search queue is full	2024-03-26 18:08:59 +01:00
Clément Renault	34262c7a0d	Add analytics for the negative operator	2024-03-26 18:01:27 +01:00
Tamo	e2a1bbae37	simplify and improve the http error	2024-03-26 17:53:37 +01:00
Tamo	e433fd53e6	rename the method to get a permit and use it in all search requests	2024-03-26 17:28:03 +01:00
Tamo	3f23fbb46d	create the experimental CLI argument	2024-03-26 16:43:40 +01:00
Tamo	c41e1274dc	push and test the search queue datastructure	2024-03-26 15:56:43 +01:00
Louis Dureuil	9a95ed619d	Add tests	2024-03-26 10:36:56 +01:00
Louis Dureuil	f82d056072	Hide secrets in settings and task queue	2024-03-26 10:36:24 +01:00
meili-bors[bot]	5ea017b922	Merge #4530 4530: fix: set the histogram bucket boundaries to follow the otel spec r=curquiza a=rohankmr414 # Pull Request ## What does this PR do? - Fixes the http request duration histogram bucket boundaries to follow the opentelemetry spec, currently the bucket boundaries are too granular and only track latencies below 1s. ## PR checklist Please check if your PR fulfills the following requirements: - [ ] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)? - [x] Have you read the contributing guidelines? - [x] Have you made sure that the title is accurate and descriptive of the changes? Thank you so much for contributing to Meilisearch! Co-authored-by: Rohan Kumar <rohankmr414@gmail.com>	2024-03-25 12:23:31 +00:00
Louis Dureuil	dfa5e41ea6	Check validity of the URL setting	2024-03-25 11:23:16 +01:00
Louis Dureuil	f649f58013	embed no longer async	2024-03-25 11:23:03 +01:00
Rohan Kumar	13a84ae557	fix: set the histogram bucket boundaries to follow the otel spec	2024-03-25 11:20:30 +05:30
Rohan Kumar	5833070358	feat: add status code label to prometheus http request counter	2024-03-25 10:49:40 +05:30
Tamo	d8fe4fe49d	return the order in the score details	2024-03-19 15:45:04 +01:00
Tamo	0ae39644f7	fix the facet search	2024-03-19 15:07:06 +01:00
Tamo	4369e9e97c	add an error code test on the setting	2024-03-19 11:14:28 +01:00
Tamo	7bd881b9bc	adds the degraded searches to the prometheus dashboard	2024-03-19 10:35:47 +01:00
Tamo	6a0c399c2f	rename the search_cutoff parameter to search_cutoff_ms	2024-03-19 10:35:47 +01:00
Tamo	038c26c118	stop returning the degraded boolean when a search was cutoff	2024-03-19 10:35:47 +01:00
Tamo	ad9192fbbf	reduce the size of an integration test	2024-03-19 10:35:47 +01:00
Tamo	b8cda6c300	fix the search cutoff and add a test	2024-03-19 10:35:47 +01:00
Tamo	b72495eb58	fix the settings tests	2024-03-19 10:28:23 +01:00
Tamo	d1db495119	add a settings for the search cutoff	2024-03-19 10:28:23 +01:00
Tamo	4a467739cd	implements a first version of the cutoff without settings	2024-03-19 10:28:21 +01:00
shuangcui	5c95b5c933	chore: remove repetitive words Signed-off-by: shuangcui <fliter@qq.com>	2024-03-14 21:28:55 +08:00
meili-bors[bot]	abd954755d	Merge #4476 4476: Make the `/facet-search` route use the `sortFacetValuesBy` setting r=irevoire a=Kerollmops This PR fixes #4423 by ensuring that the `/facet-search` route uses the `sortFacetValuesBy` setting. Note for the documentation team (to be moved in the tracking issue): Using the new `sortFacetValuesBy` setting can slow down the facet-search requests as Meilisearch iterates over the whole list of facet values and computes the count of documents on every entry. That is hardly or even impossible to optimize correctly. ### TODO - [x] Create a custom HashMap wrapper for the facet `OrderBy` settings. This wrapper will return the `OrderBy` setting of the facet, if not defined will use the default `*` one, and if not there either (strange) will fall back on the lexicographic one. - [x] Create a `ValuesCollection` wrapper that implements the logic for the lexicographic and count order by. - [x] Use it when there is no search query. - [x] Use it when there is a search query with and without allowed typos. - [x] Do not change the original logic, only use a wrapper. - [x] Add tests Co-authored-by: Clément Renault <clement@meilisearch.com>	2024-03-13 14:36:14 +00:00
Clément Renault	6c9823d7bb	Add tests to sortFacetValuesBy count	2024-03-13 11:59:39 +01:00
Clément Renault	9f7a4fbfeb	Return the facets of a placeholder facet-search sorted by count	2024-03-13 10:09:01 +01:00
meili-bors[bot]	5ed7b6a0b2	Merge #4456 4456: Add Ollama as an embeddings provider r=dureuill a=jakobklemm # Pull Request ## Related issue [Related Discord Thread](https://discord.com/channels/1006923006964154428/1211977150316683305) ## What does this PR do? - Adds Ollama as a provider of Embeddings besides HuggingFace and OpenAI under the name `ollama` - Adds the environment variable `MEILI_OLLAMA_URL` to set the embeddings URL of an Ollama instance with a default value of `http://localhost:11434/api/embeddings` if no variable is set - Changes some of the structs and functions in `openai.rs` to be public so that they can be shared. - Added more error variants for Ollama specific errors - It uses the model `nomic-embed-text` as default, but any string value is allowed, however it won't automatically check if the model actually exists or is an embedding model Tested against Ollama version `v0.1.27` and the `nomic-embed-text` model. ## PR checklist Please check if your PR fulfills the following requirements: - [x] Does this PR fix an existing issue, or have you listed the changes applied in the PR description (and why they are needed)? - [x] Have you read the contributing guidelines? - [x] Have you made sure that the title is accurate and descriptive of the changes? Co-authored-by: Jakob Klemm <jakob@jeykey.net> Co-authored-by: Louis Dureuil <louis.dureuil@gmail.com>	2024-03-13 08:48:47 +00:00
Clément Renault	d3a95ea2f6	Introduce a new OrderByMap struct to simplify the sort by usage	2024-03-12 13:56:56 +01:00
Clément Renault	69c118ef76	Extract the facet order before extracting the facets values	2024-03-12 10:35:39 +01:00
Tamo	8ec3e30d2b	Merge branch 'main' into tmp-release-v1.7.0	2024-03-11 15:39:51 +01:00
Tamo	f053c280e1	add tests when the field limit is reached	2024-03-06 18:42:41 +01:00
meili-bors[bot]	4d42a7af7c	Merge #4445 4445: Add subcommand to run benchmarks r=irevoire a=dureuill # Pull Request ## Related issue Not user-facing, no issue ## What does this PR do? - Adds a new `cargo xtask bench` subcommand that can run one or multiple workload files and report the results to a server - A workload file is a JSON file with a specific schema - Refactor our use of the `vergen` crate: - update to the beta `vergen-git2` crate - VERGEN_GIT_SEMVER_LIGHTWEIGHT => VERGEN_GIT_DESCRIBE - factor logic in a single `build-info` crate that is used both by meilisearch and xtask (prevents vergen variables from overriding themselves) - checked that defining the variables by hand when no git repo is available (docker build case) still works. - Add CI to run `cargo xtask bench` Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-03-05 14:03:57 +00:00
Louis Dureuil	7408db2a46	Meilisearch: fix date formatting	2024-03-05 14:56:48 +01:00
Louis Dureuil	15c38dca78	Output RFC 3339 dates where we can Co-authored-by: Tamo <tamo@meilisearch.com>	2024-03-05 14:44:48 +01:00
Tamo	b130917933	add the content type in the webhook + improve the test	2024-03-05 11:22:29 +01:00
Louis Dureuil	c608b3f9b5	Factor vergen stuff to a build-info crate	2024-03-05 10:11:43 +01:00
Jakob Klemm	d3004d8040	Implemented Ollama as an embeddings provider Initial prototype of Ollama embeddings actually working, error handlign / retries still missing. Allow model to be any String and require dimensions parameter Fixed rustfmt formatting issues There were some formatting issues in the initial PR and this should not make the changes comply with the Rust style guidelines Because I accidentally didn't follow the style guide for commits in my commit messages I squashed them into one to comply	2024-03-04 15:09:43 +01:00
Louis Dureuil	452a343a2b	Fix imports	2024-02-28 18:09:40 +01:00
meili-bors[bot]	b87485e80d	Merge #4433 4433: Enhance facet incremental r=Kerollmops a=ManyTheFish # Pull Request ## Related issue Fixes #4367 Fixes #4409 ## What does this PR do? - Add a test reproducing #4409 - Fix #4409 by removing a document from a level only if it is no more present in all the linked sub-level nodes - Optimize facet Incremental indexing by creating or deleting a complete level once per field id instead of for each facet value - Optimize facet Incremental indexing by doing the additions and the deletions in the same process instead of doing them separately Co-authored-by: ManyTheFish <many@meilisearch.com>	2024-02-28 15:28:46 +00:00
Louis Dureuil	716ffc07ee	Build the embedders when importing a dump	2024-02-26 22:15:57 +01:00
meili-bors[bot]	9e664d87eb	Merge #4443 4443: Add GPU analytics r=dureuill a=dureuill # Pull Request ## Related issue Adds analytics indicating whether Meilisearch was compiled with the `milli/cuda` feature. Cc `@macraig` Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-02-26 17:13:45 +00:00
Tamo	0562818c2a	fix and remove the file-store hack of /dev/null	2024-02-26 13:59:41 +01:00
Tamo	a478392b7a	create a test with the dry-run parameter enabled	2024-02-26 13:59:41 +01:00
Tamo	bbf3fb88ca	rename the cli parameter	2024-02-26 13:59:40 +01:00
Tamo	60510e037b	update the discussion link	2024-02-26 13:58:04 +01:00
Tamo	36c27a18a1	implement the dry run ha parameter	2024-02-26 13:58:04 +01:00
Tamo	1eb1c043b5	disable the auto deletion of tasks when the ha mode is enabled	2024-02-26 13:58:04 +01:00
Tamo	507739bd98	add an experimental cli parameter to allow specifying your task id	2024-02-26 13:58:03 +01:00
Tamo	eb25b07390	let you specify your task id	2024-02-26 13:56:31 +01:00
meili-bors[bot]	938149f814	Merge #4042 4042: Implements the new replication parameters r=ManyTheFish a=irevoire ### This PR implements the necessary parameters for the High Availability - [ ] Update the spec Introduce a new CLI flag called `--experimental-replication-parameters` that changes a few behaviors in the engine: - [The auto-deletion of tasks is disabled](https://specs.meilisearch.com/specifications/text/0060-tasks-api.html#_2-technical-details) - Upon registering a task, you can choose its task ID by sending a new header: `TaskId: 456645`. It must be a valid number, which must be superior to the last task id ever seen. - Add the ability to « dry-register » a task. That means meilisearch will answer to you with a valid task ID like everything went well, but won’t actually write anything in the database. To do that, you need to use the `DryRun: true` header. ---- Old prototype `prototype-custom-task-id-0`: - Adds the capability to specify your own task ID via the `TaskId` http header - Make the task IDs a u64 instead of a u32 Co-authored-by: Tamo <tamo@meilisearch.com>	2024-02-26 11:37:34 +00:00
Louis Dureuil	55796406c5	Add GPU analytics	2024-02-26 10:41:47 +01:00
Tamo	eb90f0b4fb	fix and remove the file-store hack of /dev/null	2024-02-26 10:19:07 +01:00
Tamo	c2e2003a80	create a test with the dry-run parameter enabled	2024-02-22 15:51:47 +01:00
Tamo	693ba8dd15	rename the cli parameter	2024-02-21 14:33:40 +01:00
Tamo	e1a3eed1eb	update the discussion link	2024-02-21 12:30:28 +01:00
Tamo	05ae291989	implement the dry run ha parameter	2024-02-21 11:21:26 +01:00
Tamo	6ba9994916	disable the auto deletion of tasks when the ha mode is enabled	2024-02-20 12:23:39 +01:00
Tamo	01ae46dd80	add an experimental cli parameter to allow specifying your task id	2024-02-20 11:24:44 +01:00
Tamo	9ee4f55e6c	let you specify your task id	2024-02-19 14:29:33 +01:00
ManyTheFish	865b415b3f	Add test rerpoducing bug	2024-02-15 16:00:48 +01:00
Tamo	4148d391b8	move logs to stderr	2024-02-15 15:24:16 +01:00
meili-bors[bot]	88c6165e20	Merge #4410 4410: Implement the experimental log mode cli flag and log level updates at runtime r=dureuill a=irevoire # Pull Request This PR fixes two issues at once because they’re highly correlated in the codebase. ## Related issue Fixes https://github.com/meilisearch/meilisearch/issues/4415 Fixes https://github.com/meilisearch/meilisearch/issues/4413 ## What does this PR do? - It makes the fmt logger configurable to output json or human-readable logs (like we already do today) - It moves the fmt logger under a `reload` layer so we can update its targets at runtime - Add the possibility to stream logs in the json mode - Adds an analytics for the new CLI flag Co-authored-by: Tamo <tamo@meilisearch.com>	2024-02-15 10:01:06 +00:00
Tamo	d097431113	Update meilisearch/src/option.rs Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-02-15 10:58:43 +01:00
Tamo	1f8af81ba9	update the log mode discussion link	2024-02-15 10:32:48 +01:00
Tamo	5d3bad4120	Update meilisearch/src/option.rs Co-authored-by: Louis Dureuil <louis@meilisearch.com>	2024-02-15 10:31:23 +01:00
Tamo	a081da0d90	add support for the json format in the stream route	2024-02-14 15:34:39 +01:00
Tamo	3b6544db6d	Implement the experimental log mode cli flag	2024-02-13 18:09:15 +01:00
ManyTheFish	55e942cd45	buggy	2024-02-13 15:26:30 +01:00
Eric Long	c02d585f5b	Upgrade rustls to 0.21.10 and ring to 0.17	2024-02-12 14:32:29 +08:00
Tamo	285aa15d2f	make the mode camelCase instead of lowercase	2024-02-08 15:04:06 +01:00
Tamo	2c88131bb1	rename the fmt mode to human	2024-02-08 15:04:06 +01:00
Tamo	35aa9d5904	fix an error message	2024-02-08 15:04:06 +01:00
Tamo	cfb3e6b51f	update the actix-web trace	2024-02-08 15:04:06 +01:00
Tamo	1502382316	use debug instead of debug_span	2024-02-08 15:04:06 +01:00
Louis Dureuil	ef994d84d0	Change error messages and fix tests	2024-02-08 15:04:06 +01:00
Louis Dureuil	1b74010e9e	Remove "with_line_numbers"	2024-02-08 15:04:06 +01:00
Tamo	08af0e690c	Structures a bunch of logs	2024-02-08 15:04:06 +01:00
Louis Dureuil	d71b77f18b	Add panic hook to log panics	2024-02-08 15:04:06 +01:00
Louis Dureuil	91eb67e981	logs route: make memory profiling toggling usable	2024-02-08 15:04:05 +01:00
Tamo	f70a615ed9	update the github discussion links	2024-02-08 15:04:05 +01:00
Tamo	7ff722b72e	get rids of the log dependencies everywhere	2024-02-08 15:04:05 +01:00
Tamo	bcf7909bba	add a profile_memory parameter disabled by default	2024-02-08 15:04:05 +01:00
Tamo	ceb211c515	move the /logs route to the /logs/stream route	2024-02-08 15:04:05 +01:00
Tamo	4de2db6786	add back the actix-web logs	2024-02-08 15:04:05 +01:00
Louis Dureuil	661baa716b	logs route profile mode: don't barf bytes if the buffer is not empty	2024-02-08 15:04:05 +01:00
Clément Renault	b393823f36	Replace stats_alloc with procfs	2024-02-08 15:04:05 +01:00
Tamo	f158e96fe7	fix the auth	2024-02-08 15:04:05 +01:00
Tamo	e23ec4886d	fix the tests and add tests on the experimental features	2024-02-08 15:04:03 +01:00
Tamo	7793ba67a4	hide the route logs behind a feature flag	2024-02-08 15:03:33 +01:00
Tamo	80774148fd	handle and tests errors	2024-02-08 15:03:33 +01:00
Tamo	bf5cea8b10	add a test	2024-02-08 15:03:33 +01:00
Louis Dureuil	38e1c40f38	meilisearch: logs route disconnects in profile mode	2024-02-08 15:03:33 +01:00
Louis Dureuil	afc0585c1c	meilisearch: don't spawn a report everytime Meilisearch starts	2024-02-08 15:03:33 +01:00
Tamo	77254765e8	get rids of env loggegr and fix the tests	2024-02-08 15:03:33 +01:00
Tamo	ce6e6ec2c5	stops profiling in a file by default	2024-02-08 15:03:32 +01:00
Louis Dureuil	91a8f74763	Add cancel log route	2024-02-08 15:03:32 +01:00
Tamo	abaa72e2bf	start handling reloads with profiling	2024-02-08 15:03:32 +01:00
Tamo	3c3a258a22	start exposing the profiling layer	2024-02-08 15:03:32 +01:00
Louis Dureuil	73e66d5a97	Add dummy log when calling tasks	2024-02-08 15:03:32 +01:00
Louis Dureuil	b8da117b9c	Simplify stream implementation	2024-02-08 15:03:32 +01:00
Louis Dureuil	5e52107474	better than before???	2024-02-08 15:03:32 +01:00
Tamo	bcf1c4dae5	make it compile and runtime error	2024-02-08 15:03:32 +01:00
Tamo	50f84d43f5	init commit	2024-02-08 15:03:32 +01:00
Tamo	f76cc0806e	WIP: first draft at introducing a new log route	2024-02-08 15:03:32 +01:00
Louis Dureuil	6e23040464	Use with tokio channel in Meilisearch	2024-02-08 15:03:32 +01:00
Clément Renault	ca8990394e	Remove the stats_alloc from the default features	2024-02-08 15:03:31 +01:00
Clément Renault	83fb2949c3	Give the allocator to the tracer when necessary	2024-02-08 15:03:31 +01:00
Clément Renault	771861599b	Logging the memory usage over time	2024-02-08 15:03:31 +01:00
Louis Dureuil	7e47cea0c4	Add tracing to Meilisearch	2024-02-08 15:03:31 +01:00
Louis Dureuil	05edd85d75	Stabilize scoreDetails	2024-02-06 11:15:19 +01:00
Morgane Dubus	880e790bff	Update Cargo.toml	2024-02-01 10:33:27 +01:00
Tamo	318843aacd	add a bunch of tests and fix the error message when adding the geosearch as filterable/sortable while there is malformed documents in the DB	2024-02-01 10:33:27 +01:00
Louis Dureuil	6d111139b5	Add test	2024-02-01 10:33:27 +01:00
Tamo	c1bf33a112	Revert "Remove panic on the geosearch"	2024-01-25 18:51:19 +01:00
Tamo	7d190d8078	add a bunch of tests and fix the error message when adding the geosearch as filterable/sortable while there is malformed documents in the DB	2024-01-17 15:51:52 +01:00
Clément Renault	50e1d34c66	Rollback http to 0.2.11	2024-01-16 16:57:33 +01:00
Clément Renault	406531c991	Fix sysinfo	2024-01-16 16:49:51 +01:00
Clément Renault	0c8d1644a6	Rollback rustls to 0.20.9	2024-01-16 15:55:16 +01:00
Clément Renault	5e0268d40e	Fix the sysinfo errors	2024-01-16 15:43:03 +01:00
Clément Renault	7f125bfb12	Update incompatible dependencies	2024-01-16 15:15:54 +01:00
Clément Renault	5869ca7716	Upgrade all compatible dependencies	2024-01-16 15:05:03 +01:00
Louis Dureuil	38abfec611	Fix tests	2024-01-11 21:35:30 +01:00
meili-bors[bot]	e93d36d5b9	Merge #4313 4313: Fix document formatting performances r=Kerollmops a=ManyTheFish reduce the formatted option list to the attributes that should be formatted, instead of all the attributes to display. The time to compute the `format` list scales with the number of fields to format; cumulated with `map_leaf_values` that iterates over all the nested fields, it gives a quadratic complexity: `d*f` where `d` is the total number of fields to display and `f` is the total number of fields to format. Co-authored-by: ManyTheFish <many@meilisearch.com>	2024-01-11 14:19:44 +00:00
ManyTheFish	95f8e21533	fix typos	2024-01-11 15:07:08 +01:00
ManyTheFish	b79b03d4e2	Fix proximity precision telemetry	2024-01-11 13:24:26 +01:00
ManyTheFish	86270e6878	Transform fields contained into _format into strings	2024-01-11 12:44:56 +01:00
ManyTheFish	81b6128b29	Update tests	2024-01-11 12:28:32 +01:00
ManyTheFish	5f5a486895	Reduce formatting time	2024-01-11 11:36:41 +01:00
Clément Renault	3f3462ab62	Limit the number of values returned by the facet search	2024-01-10 16:54:08 +01:00

... 3 4 5 6 7 ...

1012 Commits