mixedbread-ai/binary-embeddings

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mixedbread-ai/binary-embeddings)

mixedbread-ai / binary-embeddings

Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster retrieval.

☆19

Alternatives and similar repositories for binary-embeddings

Users that are interested in binary-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mixedbread-ai / python-sdk
View on GitHub
mixedbread ai python sdk
☆12Jul 1, 2024Updated 2 years ago
mixedbread-ai / wiki_demo_app
View on GitHub
☆14Jun 25, 2024Updated 2 years ago
mixedbread-ai / ofen
View on GitHub
WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included
☆17Oct 2, 2024Updated last year
mixedbread-ai / mxbai-rerank
View on GitHub
Crispy reranking models by Mixedbread
☆52Sep 17, 2025Updated 10 months ago
mixedbread-ai / batched
View on GitHub
The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…
☆161Jul 14, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tmalsburg / llm_surprisal
View on GitHub
Simple tool for generating tokens with open source transformers and/or calculate per-token surprisal.
☆14Jul 10, 2026Updated last week
shauli-ravfogel / descriptions
View on GitHub
☆10May 11, 2024Updated 2 years ago
opensensordotdev / inference
View on GitHub
Rust crate for submitting inference requests to machine learning models
☆15May 24, 2024Updated 2 years ago
badrex / rdf2text
View on GitHub
Generating text from RDF data with sequence to sequence models
☆11Jul 25, 2018Updated 7 years ago
mrmps / ai-chunker
View on GitHub
Chunk your text using gpt4o-mini more accurately
☆44Aug 3, 2024Updated last year
utter-project / fairseq
View on GitHub
This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.
☆21Nov 19, 2024Updated last year
yujie-xing / Neural-Persona-based-Conversation-Model-Python-Version
View on GitHub
A PyTorch re-implementation of the persona-based neural conversation model proposed by Jiwei Li, Michel Galley, Chris Brockett, Georgios …
☆26Apr 30, 2020Updated 6 years ago
embeddings-benchmark / arena
View on GitHub
Code for the MTEB Arena
☆25Jul 2, 2025Updated last year
mixedbread-ai / maxsim-cpu
View on GitHub
☆57Jul 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GIScience / ohsome-py
View on GitHub
Python bindings for the ohsome API
☆18Updated this week
shubham0204 / tfidf-summarizer.rs
View on GitHub
Simple, efficient and cross-platform TFIDF-based text summarizer in Rust
☆13Apr 12, 2024Updated 2 years ago
chatopera / node-word2vec
View on GitHub
Word2vec Model Reader for Node.js Client
☆13May 8, 2019Updated 7 years ago
monosans / pyromark
View on GitHub
Blazingly fast Markdown parser for Python written in Rust.
☆43Updated this week
hkust-nlp / SynCSE
View on GitHub
This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"
☆40Jun 9, 2023Updated 3 years ago
epfl-ada / 2022
View on GitHub
Materials for Applied Data Analysis CS-401, Fall 2022
☆26Nov 20, 2023Updated 2 years ago
awni / mlx-examples
View on GitHub
Examples in the MLX framework
☆11Sep 23, 2024Updated last year
Rorical / clip-as-service-rs
View on GitHub
A blazing fast CLIP gRPC service in rust.
☆16Aug 9, 2023Updated 2 years ago
uds-lsv / BERT-LNL
View on GitHub
Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification
☆10May 31, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wzpan / CV
View on GitHub
My CV
☆11Jan 15, 2022Updated 4 years ago
osoken / sqlitecollections
View on GitHub
Python collections that are backended by sqlite3 DB and are compatible with the built-in collections
☆13Jan 26, 2023Updated 3 years ago
qipeng / wikiextractor
View on GitHub
A tool for extracting plain text from Wikipedia dumps
☆15Sep 13, 2018Updated 7 years ago
nusr / react-esbuild-boilerplate
View on GitHub
An extremely fast react boilerplate
☆11May 21, 2024Updated 2 years ago
tgogos / ocr_greek
View on GitHub
resources, links for OCR & greek
☆11Mar 8, 2021Updated 5 years ago
petabi / petal-neighbors
View on GitHub
Nearest neighbor search algorithms including a ball tree and a vantage point tree.
☆12Jun 16, 2026Updated last month
4AI / langml
View on GitHub
A Keras-based and TensorFlow-backend NLP Models Toolkit.
☆12Jul 7, 2022Updated 4 years ago
speechmatics / ctranslate2_triton_backend
View on GitHub
Triton backend for https://github.com/OpenNMT/CTranslate2
☆35Jul 7, 2023Updated 3 years ago
152334H / 152334H.github.io
View on GitHub
Revamped: Hugo+LoveIt
☆10Jul 14, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
JohnyTheCarrot / discord-embed-previewer
View on GitHub
An open-source extension for previewing what your site's embed would look like when your site is linked in Discord.
☆70Dec 7, 2024Updated last year
timofurrer / ariseem
View on GitHub
Minimalistic REST API for wake-on-lan
☆11Nov 1, 2017Updated 8 years ago
JungHoyoun / PromptCompressor
View on GitHub
☆12Apr 29, 2024Updated 2 years ago
nii-yamagishilab / SpeechSPC-mini
View on GitHub
Speech Security and Privacy Compendium - Mini
☆10Jun 18, 2024Updated 2 years ago
MayankFawkes / transfer.sh
View on GitHub
Transfer.sh command line program, Now file sharing from the command line is easy.
☆13Feb 28, 2023Updated 3 years ago
vmonaco / kboc
View on GitHub
Code for submissions to the Keystroke Biometrics Ongoing Competition (KBOC)
☆12Dec 21, 2016Updated 9 years ago
tanmayb123 / BertPreTraining
View on GitHub
☆11Nov 10, 2020Updated 5 years ago