Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster retrieval.
☆19Mar 23, 2024Updated 2 years ago
Alternatives and similar repositories for binary-embeddings
Users that are interested in binary-embeddings are comparing it to the libraries listed below
Sorting:
- mixedbread ai python sdk☆12Jul 1, 2024Updated last year
- ☆14Jun 25, 2024Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆209Aug 31, 2024Updated last year
- Crispy reranking models by Mixedbread☆50Sep 17, 2025Updated 6 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆159Jul 14, 2025Updated 8 months ago
- Semantically Search Emojis From the Command Line!☆13Nov 26, 2023Updated 2 years ago
- Simple tool for generating tokens with open source transformers and/or calculate per-token surprisal.☆14May 23, 2025Updated 10 months ago
- Rust crate for submitting inference requests to machine learning models☆15May 24, 2024Updated last year
- Generating text from RDF data with sequence to sequence models☆12Jul 25, 2018Updated 7 years ago
- Leveraging LLMs for Post-OCR Correction of Historical Newspapers☆15Jun 20, 2024Updated last year
- Chunk your text using gpt4o-mini more accurately☆44Aug 3, 2024Updated last year
- Code for the MTEB Arena☆24Jul 2, 2025Updated 8 months ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 4 months ago
- This repo provides methods for building and evaluating Retrieval Augmented Generation (RAG) systems.☆18Sep 25, 2024Updated last year
- ☆10Oct 2, 2024Updated last year
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆43Oct 7, 2024Updated last year
- Examples in the MLX framework☆11Sep 23, 2024Updated last year
- Blazingly fast Markdown parser for Python written in Rust.☆40Mar 16, 2026Updated last week
- ComfyUI-Direct3D‑S2 is now available in ComfyUI, Direct3D‑S2 - Gigascale 3D Generation Made Easy with Spatial Sparse Attention. Direct3D‑…☆17Jun 10, 2025Updated 9 months ago
- ☆13Apr 25, 2025Updated 10 months ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆16May 18, 2023Updated 2 years ago
- Word2vec Model Reader for Node.js Client☆13May 8, 2019Updated 6 years ago
- My CV☆12Jan 15, 2022Updated 4 years ago
- Flexible and transparent Python Boruta implementation☆15Jun 8, 2025Updated 9 months ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆18Jun 9, 2022Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆36Feb 5, 2026Updated last month
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Sep 18, 2024Updated last year
- ☆12Apr 29, 2024Updated last year
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago
- Revamped: Hugo+LoveIt☆10Mar 14, 2026Updated last week
- Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"☆17Mar 29, 2024Updated last year
- Transfer.sh command line program, Now file sharing from the command line is easy.☆13Feb 28, 2023Updated 3 years ago
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- ☆11Nov 10, 2020Updated 5 years ago
- ☆10Oct 15, 2019Updated 6 years ago
- Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification☆10May 31, 2022Updated 3 years ago
- An open-source extension for previewing what your site's embed would look like when your site is linked in Discord.☆65Dec 7, 2024Updated last year
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Jul 7, 2023Updated 2 years ago