mixedbread-ai / wiki_demo_app
☆11Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for wiki_demo_app
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆13Updated last month
- Late Interaction Models Training & Retrieval☆161Updated 2 weeks ago
- ☆106Updated 3 weeks ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆158Updated 2 months ago
- ☆64Updated this week
- mixedbread ai python sdk☆10Updated 4 months ago
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆16Updated 7 months ago
- Python API for https://vespa.ai, the open big data serving engine☆101Updated this week
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆182Updated last month
- Generalist and Lightweight Model for Text Classification☆48Updated 2 months ago
- Neural Search☆344Updated 5 months ago
- ☆200Updated 9 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆101Updated 5 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆88Updated 8 months ago
- A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.☆175Updated 4 months ago
- ☆23Updated 4 months ago
- ☆130Updated 2 weeks ago
- Let's build better datasets, together!☆202Updated 3 months ago
- Distill a Small Static Model from any Sentence Transformer☆389Updated last week
- Structured generation in Rust☆116Updated this week
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆167Updated 3 months ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆93Updated this week
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆322Updated last year
- experiments with inference on llama☆105Updated 5 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆98Updated 9 months ago
- ☆45Updated 2 years ago
- Chunk your text using gpt4o-mini more accurately☆39Updated 3 months ago
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆144Updated last month
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆383Updated 9 months ago
- Efficient vector database for hundred millions of embeddings.☆200Updated 5 months ago