AnswerDotAI/ModernBERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AnswerDotAI/ModernBERT)

AnswerDotAI / ModernBERT

Bringing BERT into modernity via both architecture changes and scaling

☆1,701

Alternatives and similar repositories for ModernBERT

Users that are interested in ModernBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆875Jul 13, 2026Updated last week
AnswerDotAI / rerankers
View on GitHub
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,624Dec 20, 2025Updated 7 months ago
McGill-NLP / llm2vec
View on GitHub
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
☆1,704Apr 4, 2026Updated 3 months ago
nomic-ai / contrastors
View on GitHub
Train Models Contrastively in Pytorch
☆798Mar 26, 2025Updated last year
huggingface / setfit
View on GitHub
Efficient few-shot learning with Sentence Transformers
☆2,772May 26, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
s-smits / modernbert-finetune
View on GitHub
Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.
☆74Jan 16, 2026Updated 6 months ago
AnswerDotAI / RAGatouille
View on GitHub
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,939May 17, 2025Updated last year
xhluca / bm25s
View on GitHub
Fast BM25 search in Python, powered by Numpy and Numba
☆1,740Jul 7, 2026Updated 2 weeks ago
chandar-lab / NeoBERT
View on GitHub
☆108Jun 2, 2025Updated last year
stanford-futuredata / ColBERT
View on GitHub
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
☆3,902Oct 14, 2025Updated 9 months ago
FlagOpen / FlagEmbedding
View on GitHub
Retrieval and Retrieval-augmented LLMs
☆11,955Apr 22, 2026Updated 2 months ago
argilla-io / distilabel
View on GitHub
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆3,334Jul 13, 2026Updated last week
MinishLab / model2vec
View on GitHub
Fast State-of-the-Art Static Embeddings
☆2,159Jun 6, 2026Updated last month
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,892Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
View on GitHub
☆53Feb 10, 2025Updated last year
AnswerDotAI / fastkmeans
View on GitHub
☆101Jul 4, 2025Updated last year
jxmorris12 / cde
View on GitHub
code for training & evaluating Contextual Document Embedding models
☆207May 14, 2025Updated last year
MinishLab / semhash
View on GitHub
Fast Multimodal Semantic Deduplication & Filtering
☆946May 24, 2026Updated last month
embeddings-benchmark / mteb
View on GitHub
MTEB: State-of-the-art evaluation of embeddings across languages and modalities
☆3,363Updated this week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆36,252Updated this week
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆24,497Updated this week
meta-pytorch / torchtune
View on GitHub
PyTorch native post-training library
☆5,784Updated this week
huggingface / peft
View on GitHub
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆21,415Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
urchade / GLiNER
View on GitHub
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts)
☆3,409Updated this week
Knowledgator / FlashDeBERTa
View on GitHub
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆90Feb 10, 2026Updated 5 months ago
dottxt-ai / outlines
View on GitHub
Structured Outputs
☆14,573Updated this week
huggingface / nanotron
View on GitHub
Minimalistic large language model 3D-parallelism training
☆2,755May 26, 2026Updated last month
facebookresearch / blt
View on GitHub
Code for BLT research paper
☆2,052Nov 3, 2025Updated 8 months ago
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,246Jun 17, 2026Updated last month
bitsandbytes-foundation / bitsandbytes
View on GitHub
Accessible large language models via k-bit quantization for PyTorch.
☆8,333Updated this week
huggingface / datatrove
View on GitHub
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
☆3,214Updated this week
JHU-CLSP / ettin-encoder-vs-decoder
View on GitHub
State-of-the-art paired encoder and decoder models (17M-1B params)
☆74Aug 6, 2025Updated 11 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
huggingface / sentence-transformers
View on GitHub
State-of-the-Art Embeddings, Retrieval, and Reranking
☆18,923Updated this week
JHU-CLSP / mmBERT
View on GitHub
A massively multilingual modern encoder language model
☆145Jan 20, 2026Updated 6 months ago
huggingface / lighteval
View on GitHub
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆2,486Jun 29, 2026Updated 3 weeks ago
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,219Updated this week
mixedbread-ai / mxbai-rerank
View on GitHub
Crispy reranking models by Mixedbread
☆52Sep 17, 2025Updated 10 months ago
huggingface / text-generation-inference
View on GitHub
Large Language Model Text Generation Inference
☆10,876Mar 21, 2026Updated 3 months ago
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆30,545Updated this week