lightonai / ducksearch
Efficient BM25 with DuckDB 🦆
☆29Updated last month
Related projects ⓘ
Alternatives and complementary repositories for ducksearch
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- Tree-based indexes for neural-search☆28Updated 8 months ago
- ☆45Updated 2 weeks ago
- Render notebooks like nbviewer, but using Quarto as the renderer☆56Updated 6 months ago
- ☆66Updated this week
- ☆112Updated this week
- Late Interaction Models Training & Retrieval☆166Updated this week
- minimal pytorch implementation of bm25 (with sparse tensors)☆90Updated 8 months ago
- It's a cooler way to store simple linear models.☆28Updated 4 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆36Updated 7 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆75Updated 4 months ago
- Tools to make language models a bit easier to use☆30Updated last week
- Have UV deal with all your Jupyter deps.☆18Updated 2 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆98Updated 10 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- Track OpenAI compatible requests to a dataset☆57Updated this week
- Website for Applied-LLMs work☆20Updated last month
- Python API for https://vespa.ai, the open big data serving engine☆105Updated this week
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆162Updated 2 months ago
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆64Updated 3 weeks ago
- hnsw implemented by python☆19Updated 4 years ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 2 months ago
- spaCy entry points for Curated Transformers☆25Updated last month
- Efficiently computing & storing token n-grams from large corpora☆15Updated last month
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 8 months ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆21Updated 11 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 9 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆47Updated last month