mixedbread-ai / binary-embeddings
Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster retrieval.
☆16Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for binary-embeddings
- mixedbread ai python sdk☆10Updated 4 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- ☆48Updated 2 months ago
- Efficiently computing & storing token n-grams from large corpora☆15Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆22Updated 7 months ago
- Training hybrid models for dummies.☆15Updated last week
- Lightweight tools for quick and easy LLM demo's☆25Updated last month
- Efficient few-shot learning with cross-encoders.☆40Updated 8 months ago
- Using short models to classify long texts☆20Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 11 months ago
- ☆41Updated 3 months ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 7 months ago
- PyTorch implementation for MRL☆18Updated 8 months ago
- GLiNER model in a FastAPI microservice.☆28Updated 2 weeks ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆40Updated last week
- Vector Database with support for late interaction and token level embeddings.☆52Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆59Updated this week
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆36Updated 6 months ago
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated 2 weeks ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆158Updated 2 months ago
- Rust bindings for CTranslate2☆13Updated last year
- ☆24Updated 8 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆52Updated last week
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆59Updated last week
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- ☆64Updated this week
- Tools to make language models a bit easier to use☆30Updated 2 weeks ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11Updated 6 months ago