lightonai / ducksearch
Efficient BM25 with DuckDB π¦
β44Updated 3 months ago
Alternatives and similar repositories for ducksearch:
Users that are interested in ducksearch are comparing it to the libraries listed below
- NLP with Rust for Python π¦πβ61Updated 10 months ago
- Tree-based indexes for neural-searchβ30Updated last year
- A library to use `modal` as a backend for `joblib`.β28Updated 2 months ago
- Graph Engine for Exploration and Searchβ40Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated last year
- β66Updated 5 months ago
- Use sync mode Playwright interactively, inside a Jupyter notebookβ14Updated last week
- Website for Applied-LLMs workβ22Updated 3 weeks ago
- Locality Sensitive Hashingβ72Updated last year
- hnsw implemented by pythonβ20Updated 5 years ago
- β67Updated 3 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ23Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β80Updated 3 months ago
- Inference engine for GLiNER models, in Rustβ44Updated 2 weeks ago
- Python package for deduplication/entity resolution using active learningβ78Updated 7 months ago
- spaCy entry points for Curated Transformersβ29Updated 6 months ago
- It's a cooler way to store simple linear models.β28Updated 8 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.β34Updated this week
- Pipeline components that support partial_fit.β46Updated 8 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram andβ¦β19Updated 3 weeks ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β128Updated 3 months ago
- Prototyping a question and answer bot over PDFsβ39Updated last year
- Pre-train Static Word Embeddingsβ53Updated this week
- β30Updated 2 years ago
- Time series forecasting with DuckDB and Evidenceβ39Updated 5 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β174Updated 7 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.β108Updated 3 months ago
- A text embedding extension for the Polars Dataframe library.β24Updated 4 months ago
- A text-to-SQL prototype on the northwind sqlite datasetβ12Updated 6 months ago
- Neural Solr = Solr 9 + Mighty Inference + Nodeβ17Updated 2 years ago