lightonai / ducksearchLinks
Efficient BM25 with DuckDB π¦
β49Updated 5 months ago
Alternatives and similar repositories for ducksearch
Users that are interested in ducksearch are comparing it to the libraries listed below
Sorting:
- NLP with Rust for Python π¦πβ62Updated 3 weeks ago
- Plug-and-play document processing pipelines with zero-shot models.β64Updated 3 weeks ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated last year
- Pre-train Static Word Embeddingsβ70Updated this week
- Tree-based indexes for neural-searchβ32Updated last year
- A library to use `modal` as a backend for `joblib`.β28Updated 4 months ago
- β70Updated 6 months ago
- spaCy entry points for Curated Transformersβ31Updated last week
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ26Updated last year
- β30Updated 2 years ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated last year
- β57Updated 2 weeks ago
- Python package for extractive NLP using the OpenAI APIβ17Updated 9 months ago
- Use sync mode Playwright interactively, inside a Jupyter notebookβ14Updated 2 months ago
- Graph Engine for Exploration and Searchβ42Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β136Updated last week
- β70Updated 5 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ99Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β31Updated 9 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.β55Updated 2 weeks ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β31Updated last month
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β108Updated last year
- Small python package to measure OCR quality and other related metrics.β22Updated last year
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β54Updated last month
- Source code and data for Like a Good Nearest Neighborβ29Updated 4 months ago
- Library for fast text representation and classification.β28Updated last year
- Python package for deduplication/entity resolution using active learningβ80Updated 9 months ago
- Website for Applied-LLMs workβ27Updated last month
- Vector Database with support for late interaction and token level embeddings.β54Updated 8 months ago
- Locality Sensitive Hashingβ71Updated last year