Louis Dureuil
|
3d8024fb2b
|
write the weighted fields ids map
|
2024-09-30 11:35:03 +02:00 |
|
Louis Dureuil
|
4b0da0ff24
|
Fix inversion of field_id and position
|
2024-09-30 11:34:50 +02:00 |
|
ManyTheFish
|
960060ebdf
|
Fix fst builder when their is no previous FST
|
2024-09-25 16:53:00 +02:00 |
|
Clément Renault
|
3d244451df
|
Reduce the lru key size from 8 to 12 bytes
|
2024-09-25 16:14:13 +02:00 |
|
Clément Renault
|
5f53935c8a
|
Fix a bug in the Lru
|
2024-09-25 16:09:34 +02:00 |
|
Clément Renault
|
29a7623c3f
|
Fxi some logs
|
2024-09-25 15:57:50 +02:00 |
|
Clément Renault
|
e97041f7d0
|
Replace the Lru free list by a simple increment
|
2024-09-25 15:55:52 +02:00 |
|
Clément Renault
|
52d7f3ed1c
|
Reduce the lru key size from 20 to 8 bytes
|
2024-09-25 15:37:13 +02:00 |
|
Clément Renault
|
86d5e6d9ff
|
Use the new Lru
|
2024-09-25 14:54:56 +02:00 |
|
Clément Renault
|
759b9b1546
|
Introduce a new custom Lru
|
2024-09-25 14:49:12 +02:00 |
|
ManyTheFish
|
3f7a500f3b
|
Build prefix fst
|
2024-09-25 14:36:06 +02:00 |
|
ManyTheFish
|
974272f2e9
|
Merge branch 'main' into indexer-edition-2024
|
2024-09-25 07:41:16 +02:00 |
|
Clément Renault
|
7ad037841f
|
Move the tracing info to eprintln
|
2024-09-24 18:21:58 +02:00 |
|
Clément Renault
|
e0c7067355
|
Expose an IndexedParallelIterator to the index function
|
2024-09-24 17:24:59 +02:00 |
|
ManyTheFish
|
6e87332410
|
Change the way the FST is built
|
2024-09-24 16:28:31 +02:00 |
|
Clément Renault
|
2d1caf27df
|
Use eprintln to log
|
2024-09-24 15:59:50 +02:00 |
|
Clément Renault
|
7f148c127c
|
Measure the SmallVec efficacity
|
2024-09-24 15:32:15 +02:00 |
|
Clément Renault
|
4ce5d3d66d
|
Do not check before pushing in bitmaps
|
2024-09-24 09:43:16 +02:00 |
|
Clément Renault
|
42b093687d
|
Introduce the new PushOptimizedBitmap
|
2024-09-23 16:38:21 +02:00 |
|
Clément Renault
|
f00664247d
|
Add more stats about the channel message sent
|
2024-09-23 15:13:52 +02:00 |
|
Clément Renault
|
013acb3d93
|
Measure merger writer channel contention
|
2024-09-23 11:07:59 +02:00 |
|
Tamo
|
1113c42de0
|
fix broken comments
|
2024-09-19 16:18:36 +02:00 |
|
Tamo
|
b6b73fe41c
|
Update milli/src/update/settings.rs
Co-authored-by: Louis Dureuil <louis@meilisearch.com>
|
2024-09-19 15:41:14 +02:00 |
|
Tamo
|
163f8023a1
|
remove debug println
|
2024-09-19 12:13:25 +02:00 |
|
Tamo
|
633537ccd7
|
fix updating documents without updating the settings
|
2024-09-19 12:00:58 +02:00 |
|
Tamo
|
3f6301dbc9
|
fix the missing embedder name in the error message when trying to disable the binary quantization
|
2024-09-19 12:00:58 +02:00 |
|
Tamo
|
2b6952eda1
|
rename the ArroyReader to an ArroyWrapper since it can read and write
|
2024-09-19 12:00:58 +02:00 |
|
Tamo
|
79f29eed3c
|
fix the tests and the arroy_readers method
|
2024-09-19 12:00:58 +02:00 |
|
Tamo
|
cc45e264ca
|
implement the binary quantization in meilisearch
|
2024-09-19 12:00:56 +02:00 |
|
Clément Renault
|
f4ab1f168e
|
Prefer using Rc<str> than String when cloning a lot
|
2024-09-16 15:41:29 +02:00 |
|
ManyTheFish
|
1a0e962299
|
Replace hashmap by vectors in wpp
|
2024-09-16 15:01:20 +02:00 |
|
ManyTheFish
|
f13e076b8a
|
Use hashmap instead of Btree in wpp extractor
|
2024-09-16 14:40:40 +02:00 |
|
ManyTheFish
|
7ba49b849e
|
Extract and write facet databases
|
2024-09-16 09:35:16 +02:00 |
|
Clément Renault
|
f7652186e1
|
WIP geo fields
|
2024-09-12 18:01:02 +02:00 |
|
Clément Renault
|
b2f4e67c9a
|
Do not store useless updates
|
2024-09-12 15:38:31 +02:00 |
|
Clément Renault
|
ff5d3b59f5
|
Move the document id extraction to the primary key code
|
2024-09-12 12:01:42 +02:00 |
|
ManyTheFish
|
aa69308e45
|
Use a bufWriter to build word FSTs
|
2024-09-12 11:48:00 +02:00 |
|
ManyTheFish
|
eb9a20ff0b
|
Fix fid_word_docids extraction
|
2024-09-12 11:08:18 +02:00 |
|
Clément Renault
|
3e9198ebaa
|
Support guessing primary key again
|
2024-09-11 17:25:40 +02:00 |
|
Clément Renault
|
2a0ad0982f
|
Fix the document counter
|
2024-09-11 15:59:36 +02:00 |
|
ManyTheFish
|
2b317c681b
|
Build mergers in parallel
|
2024-09-11 11:49:26 +02:00 |
|
ManyTheFish
|
39b5990f64
|
Mutualize tokenization
|
2024-09-11 10:22:38 +02:00 |
|
Clément Renault
|
8287c2644f
|
Support CSV again
|
2024-09-10 21:10:28 +01:00 |
|
Clément Renault
|
c1c44a0b81
|
Impl serialize on TopLevelMap
|
2024-09-10 19:32:03 +01:00 |
|
Clément Renault
|
04596f3616
|
Move the TopLevelMap into a dedicated module
|
2024-09-10 18:01:17 +01:00 |
|
Clément Renault
|
24cb5839ad
|
Move the document changes sorting logic to a new trait
|
2024-09-10 17:37:52 +01:00 |
|
ManyTheFish
|
f69688e8f7
|
Fix several warnings in extractors and remove unreachable macros
|
2024-09-09 14:52:50 +02:00 |
|
Clément Renault
|
8fd0afaaaa
|
Make sure we iterate over the payload documents in order
|
2024-09-06 08:09:08 +02:00 |
|
Clément Renault
|
72c6a21a30
|
Use raw JSON to read the payloads
|
2024-09-05 20:08:23 +02:00 |
|
Clément Renault
|
8412be4a7d
|
Cleanup CowStr and TopLevelMap struct
|
2024-09-05 18:32:55 +02:00 |
|