A massively multilingual modern encoder language model
☆136Jan 20, 2026Updated 2 months ago
Alternatives and similar repositories for mmBERT
Users that are interested in mmBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 8 months ago
- Model implementation for the contextual embeddings project☆43Jun 2, 2025Updated 9 months ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated 10 months ago
- ☆17Jan 31, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.☆74Jan 16, 2026Updated 2 months ago
- POSIX: A Prompt Sensitivity Index for Language Models☆13Nov 13, 2024Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 2 years ago
- Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)☆69Sep 30, 2025Updated 5 months ago
- ☆24Jan 30, 2025Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆83Feb 10, 2026Updated last month
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆108Jun 2, 2025Updated 9 months ago
- Literature 📄 and datasets 📚 on automatic populism detection☆19Mar 15, 2025Updated last year
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆40Feb 7, 2026Updated last month
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated 11 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- Repository for the course "From Embeddings to Transformers: Advanced Text Analysis with Python"☆28Sep 26, 2025Updated 6 months ago
- ☆54Oct 13, 2025Updated 5 months ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for paper ”Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability“☆15Jun 13, 2023Updated 2 years ago
- Semantic Search using FAISS & ElasticSearch☆31Jun 4, 2020Updated 5 years ago
- Test-time compute in information retrieval☆54Jul 8, 2025Updated 8 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Nov 13, 2023Updated 2 years ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 7 months ago
- Query Expension for Better Query Embedding using LLMs☆68Feb 18, 2025Updated last year
- ☆10Oct 2, 2024Updated last year
- R package to wrap the Deutsche Bahn Fahrplan API☆17Feb 4, 2024Updated 2 years ago
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆38Oct 1, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆52Jun 21, 2025Updated 9 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆37Oct 16, 2025Updated 5 months ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆34Dec 9, 2025Updated 3 months ago
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 5 years ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆73Dec 30, 2025Updated 2 months ago
- 🤝 Trade any tensors over the network☆31Sep 27, 2023Updated 2 years ago
- YAST - Yet Another SPLADE or Sparse Trainer☆21Jun 16, 2025Updated 9 months ago