Louis Dureuil
|
b11df7ec34
|
Meilisearch: fix some wrong spans
|
2024-03-05 10:11:43 +01:00 |
|
ManyTheFish
|
03bb6372af
|
Change is_batchable_with by mergeable_with
|
2024-02-14 11:50:22 +01:00 |
|
ManyTheFish
|
48026aa75c
|
fix PR comments
|
2024-02-13 15:19:01 +01:00 |
|
ManyTheFish
|
be1b054b05
|
Compute chunk size based on the input data size ant the number of indexing threads
|
2024-02-08 17:28:37 +01:00 |
|
Louis Dureuil
|
db722d201a
|
Write entries into database downgraded to trace level
|
2024-02-08 15:04:05 +01:00 |
|
Tamo
|
e773dfa9ba
|
get rids of log in milli and add logs for the bucket sort
|
2024-02-08 15:04:05 +01:00 |
|
Louis Dureuil
|
5d7061682e
|
Add tracing to milli
|
2024-02-08 15:03:31 +01:00 |
|
Clément Renault
|
01e2c3d6bb
|
Bump arroy to v0.2.0
|
2024-01-16 16:45:55 +01:00 |
|
Clément Renault
|
3ee7682fa7
|
Fix some integer comparisons
|
2024-01-16 15:22:23 +01:00 |
|
Louis Dureuil
|
12940d79a9
|
WIP
- manual embedder
- multi embedders OK
- clippy + tests OK
|
2023-12-14 16:08:41 +01:00 |
|
Louis Dureuil
|
922a640188
|
WIP multi embedders
fixed template bugs
|
2023-12-14 16:08:41 +01:00 |
|
Louis Dureuil
|
65e49b7092
|
Remove stuff, add distribution shift (WIP)
|
2023-12-14 16:08:38 +01:00 |
|
Louis Dureuil
|
fb539f61fe
|
WIP
|
2023-12-14 16:07:49 +01:00 |
|
Louis Dureuil
|
cb4ebe163e
|
WIP
|
2023-12-14 16:07:49 +01:00 |
|
Louis Dureuil
|
dde3a04679
|
WIP arroy integration
|
2023-12-14 16:07:49 +01:00 |
|
Louis Dureuil
|
13c2c6c16b
|
Small commit to add hybrid search and autoembedding
|
2023-12-14 16:07:48 +01:00 |
|
Clément Renault
|
d32eb11329
|
Move to the v0.20.0-alpha.9 of heed
|
2023-11-27 11:52:22 +01:00 |
|
Clément Renault
|
0d4482625a
|
Make the changes to use heed v0.20-alpha.6
|
2023-11-23 11:43:58 +01:00 |
|
ManyTheFish
|
ebef6bc24d
|
Simplify documents database writing
|
2023-11-20 10:14:57 +01:00 |
|
ManyTheFish
|
263e825619
|
Fix typos in comments
|
2023-11-20 10:06:29 +01:00 |
|
ManyTheFish
|
70ce40828c
|
Compute word docids prefix cache
|
2023-11-08 17:01:00 +01:00 |
|
Louis Dureuil
|
cbaa54cafd
|
Fix clippy issues
|
2023-11-06 11:19:31 +01:00 |
|
Clément Renault
|
ff522c919d
|
Fix the vector extractions for the diff indexing
|
2023-11-02 15:58:08 +01:00 |
|
ManyTheFish
|
1b4ff991c0
|
update typed chunks
|
2023-11-02 15:26:20 +01:00 |
|
Clément Renault
|
dfab6293c9
|
Use an LMDB database to store the external documents ids
|
2023-10-30 11:41:23 +01:00 |
|
Louis Dureuil
|
fdf3f7f627
|
Fix facet distribution test
|
2023-10-30 11:41:23 +01:00 |
|
Louis Dureuil
|
6260cff65f
|
Actually delete documents from DB when the merge function says so
|
2023-10-30 11:41:22 +01:00 |
|
Louis Dureuil
|
85f42fbc03
|
Handle external to internal id mapping from TypedChunk::Documents
|
2023-10-30 11:40:20 +01:00 |
|
Louis Dureuil
|
946c762d28
|
WIP: reset documents in TypedChunk::Documents
|
2023-10-30 11:40:20 +01:00 |
|
Louis Dureuil
|
cda6ca1ee6
|
Remove TypedChunk::NewDocumentIds
|
2023-10-30 11:40:18 +01:00 |
|
Louis Dureuil
|
696fcf4d18
|
Fix document insertion into LMDB
|
2023-10-30 11:39:31 +01:00 |
|
Kerollmops
|
77dcbff6b2
|
Remove and Insert the DelAdd geo points
|
2023-10-30 11:39:31 +01:00 |
|
Louis Dureuil
|
59f88c14b3
|
Simplify facet update after removing Index::faceted_documents_ids
|
2023-10-30 11:39:29 +01:00 |
|
Clément Renault
|
560e8f5613
|
Introduce the CboRoaringBitmapCodec merge_deladd_into and use it
|
2023-10-30 11:34:55 +01:00 |
|
Clément Renault
|
2d3f15f82c
|
Introduce a function to only serialize the Add side of a DelAdd obkv
|
2023-10-30 11:34:55 +01:00 |
|
Clément Renault
|
40186bf403
|
Rename FieldIdWordCountDocids correctly
|
2023-10-30 11:34:50 +01:00 |
|
ManyTheFish
|
2597bbd107
|
Make script language docids map taking a tuple of roaring bitmaps expressing the deletions and the additions
|
2023-10-30 11:34:00 +01:00 |
|
ManyTheFish
|
313b16bec2
|
Support diff indexing on extract_docid_word_positions
|
2023-10-30 11:24:19 +01:00 |
|
ManyTheFish
|
1c5705c164
|
clean PR warnings
|
2023-10-30 11:22:05 +01:00 |
|
ManyTheFish
|
df9e5c8651
|
Generalize usage of CboRoaringBitmap codec to ease the use
|
2023-10-30 11:15:02 +01:00 |
|
ManyTheFish
|
748b333161
|
Add usefull debug assert before key insertion in database
|
2023-10-30 11:13:10 +01:00 |
|
ManyTheFish
|
17b647dfe5
|
Wip
|
2023-10-30 11:13:08 +01:00 |
|
Tamo
|
d772073dfa
|
use a bufreader everytime there is a grenad<file>
|
2023-10-10 15:00:30 +02:00 |
|
ManyTheFish
|
b45c36cd71
|
Merge branch 'main' into tmp-release-v1.3.0
|
2023-08-01 15:05:17 +02:00 |
|
Kerollmops
|
29ab54b259
|
Replace the hnsw crate by the instant-distance one
|
2023-07-25 12:37:35 +02:00 |
|
Kerollmops
|
eef95de30e
|
First iteration on exposing puffin profiling
|
2023-07-18 17:38:13 +02:00 |
|
Clément Renault
|
30741d17fa
|
Change the TODO message
|
2023-06-27 12:32:43 +02:00 |
|
Kerollmops
|
3e3c743392
|
Make Rustfmt happy
|
2023-06-27 12:32:41 +02:00 |
|
Kerollmops
|
ab9f2269aa
|
Normalize the vectors during indexation and search
|
2023-06-27 12:32:41 +02:00 |
|
Kerollmops
|
321ec5f3fa
|
Accept multiple vectors by documents using the _vectors field
|
2023-06-27 12:32:40 +02:00 |
|