Commit Graph

229 Commits

Author SHA1 Message Date
Clément Renault
8d71112dcb
Rewrite the phrase query postings lists
This simplified the multiword_rewrite_matches function a little bit.
2019-12-13 14:38:23 +01:00
Clément Renault
dd03a6256a
Debug pre filtered number of documents 2019-12-13 14:38:23 +01:00
Clément Renault
9c03bb3428
First probably working phrase query doc filtering 2019-12-13 14:38:23 +01:00
Clément Renault
22b19c0d93
Fix the processed distance algorithm 2019-12-13 14:38:22 +01:00
Clément Renault
0f698d6bd9
Work in progress: Bad Typo detection
I have an issue where "speakers" is split into "speaker" and "s",
when I compute the distances for the Typo criterion,
it takes "s" into account and put a distance of zero in the bucket 0
(the "speakers" bucket), therefore it reports any document matching "s"
without typos as best results.

I need to make sure to ignore "s" when its associated part "speaker"
doesn't even exist in the document and is not in the place
it should be ("speaker" followed by "s").

This is hard to think that it will had much computation time to
the Typo criterion like in the previous algorithm where I computed
the real query/words indexes based and removed the invalid ones
before sending the documents to the bucket sort.
2019-12-13 14:38:22 +01:00
Clément Renault
4e91b31b1f
Make the Typo and Words work with synonyms 2019-12-13 14:38:22 +01:00
Clément Renault
f87c67fcad
Improve the QueryEnhancer by doing a single lookup 2019-12-13 14:38:22 +01:00
Clément Renault
902625601a
Work in progress: It seems like we support synonyms, split and concat words 2019-12-13 14:38:22 +01:00
Clément Renault
d17d4dc5ec
Add more debug infos 2019-12-13 14:38:21 +01:00
Clément Renault
ef6a4db182
Before improving fields AttrCount
Removing the fields_count fetching reduced by 2 times the serach time, we should look at lazily pulling them form the criterions in needs

ugly-test: Make the fields_count fetching lazy

Just before running the exactness criterion
2019-12-13 14:38:21 +01:00
Clément Renault
11f3d7782d
Introduce the AttrCount type 2019-12-13 14:38:21 +01:00
Clément Renault
951f0bcb10
sqaush-me: Improve benchmarks naming 2019-12-13 14:17:40 +01:00
Clément Renault
d8ba405baf
Add some criterion benchmarks to help mesure improvements 2019-12-13 14:17:40 +01:00
Quentin de Quelen
3a4130f344 Allow to index files with null or boolean 2019-12-12 19:25:05 +01:00
Quentin de Quelen
88b3c05155 Stop words; Do not reindex all documents if there is no documents 2019-12-12 15:31:39 +01:00
Quentin de Quelen
a4f26e8e48 Rewrite the synonym endpoint 2019-12-12 12:47:02 +01:00
Clément Renault
dc1849d291
Bump heed to 0.6.1 2019-12-07 11:49:45 +01:00
Clément Renault
29fd54dcfa
Allow users to send csv files from stdin in examples 2019-12-05 12:23:56 +01:00
Thomas Payet
51636402c2 Add debian package in CI 2019-12-04 18:02:30 +01:00
Clément Renault
4f87465f18
Bump meilisearch crates to v0.8.4 2019-12-03 17:22:45 +01:00
qdequele
773a51e7d0
Rename 'update_type' to 'type' on EnqueuedUpdateResult 2019-11-29 15:09:48 +01:00
qdequele
7923752513
Serialize updates results to camelCase 2019-11-29 15:05:54 +01:00
Clément Renault
30cb60f679
Bump meilisearch crates to v0.8.3 2019-11-29 14:06:17 +01:00
qdequele
3a90233a3d
Add status failed on UpdateStatus 2019-11-28 18:41:11 +01:00
Clément Renault
9a2b4d08e1
Bump meilisearch crates to v0.8.2 2019-11-28 17:15:13 +01:00
Clément Renault
1def56ea11
Change the update loop to be more explicit on index clear 2019-11-27 13:43:28 +01:00
Clément Renault
f6fb31c531
Bump meilisearch crates to v0.8.1 2019-11-27 11:47:27 +01:00
Clément Renault
d08b76a323
Separate the update and main databases
We used the heed typed transaction to make it safe (https://github.com/Kerollmops/heed/pull/27).
2019-11-27 11:29:06 +01:00
Clément Renault
7cc096e0a2
Rename MeiliDB into MeiliSearch 2019-11-26 11:12:30 +01:00