Clément Renault
|
0c57cf7565
|
Replace obkv with the temporary new version of it
|
2024-08-30 11:53:58 +02:00 |
|
hanbings
|
0a40a98bb6
|
Make milli use edition 2021 (#4770)
* Make milli use edition 2021
* Add lifetime annotations to milli.
* Run cargo fmt
|
2024-07-09 17:25:39 +02:00 |
|
Louis Dureuil
|
0a8f50695e
|
Fixes for Rust v1.79
|
2024-06-13 17:47:44 +02:00 |
|
Louis Dureuil
|
e35ef31738
|
Small changes following review
|
2024-06-13 14:20:48 +02:00 |
|
Louis Dureuil
|
3bc8f81abc
|
user_provided => regenerate
|
2024-06-12 18:12:20 +02:00 |
|
Louis Dureuil
|
a89eea233b
|
Fix vectors injection
|
2024-06-12 17:10:19 +02:00 |
|
Louis Dureuil
|
d1dd7e5d09
|
In transform for removed embedders, write back their user provided vectors in documents, and clear the writers
|
2024-06-12 14:50:55 +02:00 |
|
Clément Renault
|
b81953a65d
|
Add a span for the prepare_for_documents_reindexing
|
2024-06-05 17:30:07 +02:00 |
|
Clément Renault
|
1b639ce44b
|
Reduce the number of complex calls to settings diff functions
|
2024-06-05 17:30:07 +02:00 |
|
Clément Renault
|
87cf8a3c94
|
Introduce a new way to determine the operations to perform on the fields
|
2024-06-05 17:30:07 +02:00 |
|
Clément Renault
|
0c6e4b2f00
|
Introducing a new into_del_add_obkv_conditional_operation function
|
2024-06-05 17:30:07 +02:00 |
|
Clément Renault
|
dc949ab46a
|
Remove puffin usage
|
2024-05-27 15:59:14 +02:00 |
|
Clément Renault
|
fe17c0f52e
|
Construct the minimal OBKVs according to the settings diff
|
2024-05-23 11:23:57 +02:00 |
|
Clément Renault
|
bc5663e673
|
FieldIdsMap no longer useful thanks to #4631
|
2024-05-22 16:06:15 +02:00 |
|
Clément Renault
|
500ddc76b5
|
Make the flattened sorter optional
|
2024-05-21 16:16:36 +02:00 |
|
Clément Renault
|
1aa8ed9ef7
|
Make the original sorter optional
|
2024-05-21 14:53:26 +02:00 |
|
ManyTheFish
|
3acfab2eb7
|
Fix PR comments
|
2024-04-17 10:55:51 +02:00 |
|
ManyTheFish
|
87a93ba47d
|
fix clippy
|
2024-04-16 14:39:30 +02:00 |
|
ManyTheFish
|
02c3d6b265
|
finish work
|
2024-04-16 14:39:06 +02:00 |
|
ManyTheFish
|
b5e4a55af6
|
refactor faceted and searchable pipeline
|
2024-04-16 14:39:06 +02:00 |
|
ManyTheFish
|
893200ab87
|
Avoid clearing documents in transform
|
2024-04-16 14:39:06 +02:00 |
|
Louis Dureuil
|
5d7061682e
|
Add tracing to milli
|
2024-02-08 15:03:31 +01:00 |
|
Clément Renault
|
d32eb11329
|
Move to the v0.20.0-alpha.9 of heed
|
2023-11-27 11:52:22 +01:00 |
|
Clément Renault
|
0d4482625a
|
Make the changes to use heed v0.20-alpha.6
|
2023-11-23 11:43:58 +01:00 |
|
ManyTheFish
|
d3575fb028
|
Make into_del_add_obkv parameters more human readable
|
2023-11-20 16:10:39 +01:00 |
|
Louis Dureuil
|
772964125d
|
Factor removal of document from DB
|
2023-11-13 13:51:22 +01:00 |
|
Louis Dureuil
|
264b10ec20
|
Fixup documentation
|
2023-11-09 16:23:20 +01:00 |
|
Louis Dureuil
|
3053e01c05
|
Batch::remove_documents_from_db_no_batch
|
2023-11-09 14:23:02 +01:00 |
|
Louis Dureuil
|
1ad1fcc8c8
|
Remove all warnings
|
2023-11-06 10:31:14 +01:00 |
|
ManyTheFish
|
bf0651f23c
|
Implement iter method on ExternalDocumentsIds
|
2023-11-02 15:38:00 +01:00 |
|
ManyTheFish
|
5b20e625f3
|
fix merge
|
2023-11-02 15:31:37 +01:00 |
|
ManyTheFish
|
bc51d6157a
|
Fix transform reindexing path
|
2023-11-02 15:26:20 +01:00 |
|
ManyTheFish
|
12323d610e
|
Change the original document sorter key from the internal docid to a concatenation of the internal and the external docid
|
2023-11-02 15:26:20 +01:00 |
|
Clément Renault
|
4d864f0702
|
Always sort internal Sorter entries in parallel
|
2023-11-02 14:47:43 +01:00 |
|
Clément Renault
|
c71b1d33ae
|
Sort entries using rayon in the transform sorters
|
2023-11-01 11:07:16 +01:00 |
|
Clément Renault
|
0fc446c62f
|
Add more timing logs to the Transform
|
2023-11-01 11:07:16 +01:00 |
|
Louis Dureuil
|
de10f20732
|
Fix field distribution again
|
2023-10-30 17:47:22 +01:00 |
|
Louis Dureuil
|
54d07a8da3
|
Update field distribution taking into account both deletions and additions
|
2023-10-30 14:47:51 +01:00 |
|
Clément Renault
|
dfab6293c9
|
Use an LMDB database to store the external documents ids
|
2023-10-30 11:41:23 +01:00 |
|
Louis Dureuil
|
113527f466
|
Remove soft-deleted related methods from Index
|
2023-10-30 11:41:22 +01:00 |
|
Louis Dureuil
|
c6b3c18c85
|
WIP: Comment out document deletion in other pipelines than update
TODO: fix calls to DELETE route
|
2023-10-30 11:40:20 +01:00 |
|
ManyTheFish
|
313b16bec2
|
Support diff indexing on extract_docid_word_positions
|
2023-10-30 11:24:19 +01:00 |
|
ManyTheFish
|
1dd97578a8
|
Make the transform struct return diff-based documents obkvs
|
2023-10-30 11:22:07 +01:00 |
|
Tamo
|
c0f2724c2d
|
get rids of the new introduced error code in favor of an io::Error
|
2023-10-10 15:12:23 +02:00 |
|
Tamo
|
d772073dfa
|
use a bufreader everytime there is a grenad<file>
|
2023-10-10 15:00:30 +02:00 |
|
Kerollmops
|
eef95de30e
|
First iteration on exposing puffin profiling
|
2023-07-18 17:38:13 +02:00 |
|
Tamo
|
602ad98cb8
|
improve the way we handle the fsts
|
2023-05-22 11:15:14 +02:00 |
|
Tamo
|
4391cba6ca
|
fix the addition + deletion bug
|
2023-05-17 18:28:57 +02:00 |
|
Tamo
|
895ab2906c
|
apply review suggestions
|
2023-02-16 18:42:47 +01:00 |
|
Tamo
|
74dcfe9676
|
Fix a bug when you update a document that was already present in the db, deleted and then inserted again in the same transform
|
2023-02-14 19:09:40 +01:00 |
|