mixedbread-ai / binary-embeddings
Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster retrieval.
☆15Updated 10 months ago
Alternatives and similar repositories for binary-embeddings:
Users that are interested in binary-embeddings are comparing it to the libraries listed below
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆14Updated 4 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated 10 months ago
- Efficient few-shot learning with cross-encoders.☆48Updated last year
- mixedbread ai python sdk☆10Updated 7 months ago
- Pre-train Static Word Embeddings☆47Updated 3 weeks ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 11 months ago
- ☆63Updated 2 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 3 months ago
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆15Updated this week
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 4 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆57Updated 11 months ago
- Lightweight tools for quick and easy LLM demo's☆26Updated 4 months ago
- Generalist and Lightweight Model for Text Classification☆65Updated this week
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆22Updated last year
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 4 months ago
- Training hybrid models for dummies.☆20Updated last month
- Creating Generative AI Apps which work☆16Updated 7 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆63Updated last month
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 11 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆31Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 3 months ago
- NLP with Rust for Python 🦀🐍☆61Updated 8 months ago
- Latent Large Language Models☆17Updated 5 months ago
- ☆18Updated 4 months ago
- ☆62Updated 6 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆48Updated 2 weeks ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 2 months ago
- ☆24Updated last year