Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster retrieval.
☆19Mar 23, 2024Updated 2 years ago
Alternatives and similar repositories for binary-embeddings
Users that are interested in binary-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- mixedbread ai python sdk☆12Jul 1, 2024Updated last year
- ☆14Jun 25, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆210Aug 31, 2024Updated last year
- Crispy reranking models by Mixedbread☆51Sep 17, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for "Binary embedding based retrieval at Tencent"☆45Mar 7, 2024Updated 2 years ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 10 months ago
- ☆10May 11, 2024Updated 2 years ago
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- LLM-only topic extraction and classification☆11Sep 20, 2024Updated last year
- Simple tool for generating tokens with open source transformers and/or calculate per-token surprisal.☆14Apr 15, 2026Updated last month
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- Leveraging LLMs for Post-OCR Correction of Historical Newspapers☆17May 12, 2026Updated last week
- Code for the MTEB Arena☆24Jul 2, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A PyTorch re-implementation of the persona-based neural conversation model proposed by Jiwei Li, Michel Galley, Chris Brockett, Georgios …☆26Apr 30, 2020Updated 6 years ago
- This repo provides methods for building and evaluating Retrieval Augmented Generation (RAG) systems.☆18Sep 25, 2024Updated last year
- Python bindings for the ohsome API☆18Mar 12, 2026Updated 2 months ago
- ☆10Oct 2, 2024Updated last year
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]