lightonai / ducksearch
Efficient BM25 with DuckDB π¦
β42Updated 2 months ago
Alternatives and similar repositories for ducksearch:
Users that are interested in ducksearch are comparing it to the libraries listed below
- NLP with Rust for Python π¦πβ61Updated 9 months ago
- Tree-based indexes for neural-searchβ29Updated last year
- A library to use `modal` as a backend for `joblib`.β28Updated last month
- Graph Engine for Exploration and Searchβ40Updated last year
- Python package for deduplication/entity resolution using active learningβ76Updated 6 months ago
- Chrome Extension for exploring Hugging Face datasets πβ49Updated 5 months ago
- spaCy entry points for Curated Transformersβ27Updated 5 months ago
- Pre-train Static Word Embeddingsβ48Updated this week
- β58Updated 4 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated 11 months ago
- Have UV deal with all your Jupyter deps.β24Updated 6 months ago
- hnsw implemented by pythonβ19Updated 5 years ago
- It's a cooler way to store simple linear models.β28Updated 7 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β79Updated 2 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β124Updated 2 months ago
- β30Updated 2 years ago
- MoodCatπΌ classifies the mood of English sentences.β14Updated 2 years ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ23Updated last year
- utilities for loading and running text embeddings with onnxβ44Updated 7 months ago
- Use sync mode Playwright interactively, inside a Jupyter notebookβ15Updated 3 months ago
- Playing with Python Bluesky SDKβ14Updated 3 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.β59Updated 10 months ago
- Website for Applied-LLMs workβ21Updated last month
- A production-ready, scalable Indexer for the Jina neural search framework, based on HNSW and PSQLβ29Updated 2 years ago
- Just another sentiment wrapper.β17Updated 3 years ago
- Prototyping a question and answer bot over PDFsβ38Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β171Updated 6 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.β102Updated 2 months ago