Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.
☆74Jan 16, 2026Updated 4 months ago
Alternatives and similar repositories for modernbert-finetune
Users that are interested in modernbert-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A massively multilingual modern encoder language model☆140Jan 20, 2026Updated 4 months ago
- Pre-train Static Word Embeddings☆104Updated this week
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆87Feb 10, 2026Updated 3 months ago
- Library for evaluating RAG using Nuclia's models☆18Jul 31, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated last month
- Bringing BERT into modernity via both architecture changes and scaling☆1,677Mar 1, 2026Updated 2 months ago
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated last month
- BERT&RoBERTa预训练代码,tensorflow和torch两种版本实现☆13Feb 8, 2023Updated 3 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13May 22, 2025Updated last year
- This repository contains the training and evaluation code for llm-jp-modernbert-base.☆17Jun 17, 2025Updated 11 months ago
- ☆17Feb 12, 2025Updated last year
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- NOAH's Corpus: Part-of-Speech Tagging for Swiss German☆12Jan 6, 2023Updated 3 years ago
- MLX binary vectors and associated algorithms.☆14Mar 13, 2025Updated last year
- Multilingual RAG benchmark.☆10Nov 22, 2024Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆151Jan 7, 2026Updated 4 months ago
- alternative way to calculating self attention☆18May 25, 2024Updated 2 years ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Aug 20, 2025Updated 9 months ago
- German lemmatization with IWNLP as extension for spaCy☆27Apr 13, 2026Updated last month
- GERNERMED++ is a transfer-learning-based open neural NER model for medical entities designed for German data.☆10Oct 20, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- ☆33Nov 21, 2025Updated 6 months ago
- Expert annotated Hallmarks of Cancer Corpus☆21Sep 18, 2018Updated 7 years ago
- 🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…☆29Jul 27, 2025Updated 10 months ago
- Repository for the paper: "Using deep learning to predict outcomes of legal appeals better than human experts"☆10Aug 1, 2022Updated 3 years ago
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆45May 20, 2026Updated last week
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated 2 years ago
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆15Oct 5, 2023Updated 2 years ago
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"☆35Sep 20, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Smart commit messages☆18Oct 25, 2024Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆160Jul 14, 2025Updated 10 months ago
- String Distance using cython☆13Jan 19, 2020Updated 6 years ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- a python package for loadimg and converting images☆30Feb 18, 2026Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Apr 17, 2025Updated last year
- Code for the MTEB leaderboard☆30Feb 4, 2025Updated last year