lightonai / ducksearch
Efficient BM25 with DuckDB 🦆
☆29Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for ducksearch
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- Tree-based indexes for neural-search☆28Updated 8 months ago
- ☆41Updated last week
- Late Interaction Models Training & Retrieval☆158Updated last week
- Tools to make language models a bit easier to use☆30Updated 2 weeks ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆36Updated 6 months ago
- ☆106Updated 2 weeks ago
- ☆64Updated this week
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆21Updated 11 months ago
- Website for Applied-LLMs work☆20Updated last month
- Render notebooks like nbviewer, but using Quarto as the renderer☆55Updated 5 months ago
- spaCy entry points for Curated Transformers☆24Updated last month
- Python API for https://vespa.ai, the open big data serving engine☆101Updated this week
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 7 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 9 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆158Updated 2 months ago
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- My personal frontpage app☆78Updated this week
- Have UV deal with all your Jupyter deps.☆18Updated 2 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆88Updated 8 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆75Updated 3 months ago
- Vector Database with support for late interaction and token level embeddings.☆52Updated last month
- This is the repo for the container that holds the models for the text2vec-transformers module☆40Updated last week
- Your buddy in the (L)LM space.☆64Updated last month
- Chrome Extension for exploring Hugging Face datasets 🔎☆47Updated last month
- Neural Solr = Solr 9 + Mighty Inference + Node☆16Updated 2 years ago
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated 7 months ago
- Deployment examples for FastHTML☆29Updated last month
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year