lightonai / ducksearchView external linksLinks
Efficient BM25 with DuckDB π¦
β61Dec 20, 2024Updated last year
Alternatives and similar repositories for ducksearch
Users that are interested in ducksearch are comparing it to the libraries listed below
Sorting:
- Django plugin for online machine learning with river (under-development)β15Dec 25, 2023Updated 2 years ago
- Novelty detection for data streams in Pythonβ13Aug 20, 2024Updated last year
- WIPβ36Jul 29, 2024Updated last year
- Neural Searchβ367Mar 11, 2025Updated 11 months ago
- Autoregressive Bayesian linear modelβ21Sep 10, 2020Updated 5 years ago
- Next-generation Punkt sentence boundary detection with zero dependenciesβ27Nov 18, 2025Updated 2 months ago
- π² Git scraping for bike sharing APIsβ31Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β156Jul 14, 2025Updated 7 months ago
- 4th place solution to datafactory challenge by IntermarchΓ©.β12Jun 28, 2021Updated 4 years ago
- βοΈ Measuring the accuracy of BBC weather forecasts in Honolulu, USAβ12Jul 10, 2021Updated 4 years ago
- Label shift estimation for transfer difficulty with Familiarity.β10Feb 4, 2025Updated last year
- Combining encoder-based language modelsβ11Nov 11, 2021Updated 4 years ago
- PyLate efficient inference engineβ71Jan 7, 2026Updated last month
- Tree-based indexes for neural-searchβ31Mar 4, 2024Updated last year
- Plug-and-play document AI with zero-shot models.β123Feb 7, 2026Updated last week
- Extra functionalities for riverβ14May 15, 2024Updated last year
- Demo application containing fullstack solution (frontend + backend) APIs in pure Golang.β18Dec 5, 2024Updated last year
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020β14Oct 6, 2020Updated 5 years ago
- Late Interaction Models Training & Retrievalβ701Updated this week
- π Explain why metrics change by unpacking themβ40Jan 16, 2026Updated 3 weeks ago
- β42Apr 22, 2025Updated 9 months ago
- bm25 is a scoring function that helps with information retrievalβ14Sep 17, 2020Updated 5 years ago
- ππ Lets Python do AB testing analysis.β78Apr 15, 2025Updated 9 months ago
- β91Jul 4, 2025Updated 7 months ago
- Inference code in Pytorch for GPT-like models, such as PAGnol, a family of models with up to 1.5B parameters, trained on datasets in Frenβ¦β20Oct 18, 2022Updated 3 years ago
- SMIT: A Simple Modality Integration Toolβ15Mar 31, 2024Updated last year
- A serverless duckDB deployment at GCPβ41Aug 30, 2022Updated 3 years ago
- Set up a Cost-Effective Modern Data Stack for a Charityβ19Mar 26, 2025Updated 10 months ago
- Extract, parse and populate templates from stringsβ27Apr 4, 2019Updated 6 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vecβ19Dec 18, 2022Updated 3 years ago
- Code and data for the paper "Soft Gazetteers for Low-resource Named Entity Recognition"β19Nov 3, 2020Updated 5 years ago
- β28Updated this week
- π Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durationsβ45Mar 6, 2023Updated 2 years ago
- Fast and incremental explanations for online machine learning models. Works best with the river framework.β55Dec 26, 2024Updated last year
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub π€β‘οΈβ35May 13, 2022Updated 3 years ago
- Self-contained demo using Kafka, Materialize and Metabase to check what's streaming on Twitch. All you need is Docker and Twitch access tβ¦β25Mar 22, 2022Updated 3 years ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrievalβ60Jun 20, 2024Updated last year
- Online machine learning methodsβ22Sep 29, 2021Updated 4 years ago
- API de recherche et de consultation de la plateforme JUDILIBRE.β21Dec 5, 2025Updated 2 months ago