Nicolas-BZRD/EuroBERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Nicolas-BZRD/EuroBERT)

Nicolas-BZRD / EuroBERT

Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including CPU, AMD, and NVIDIA GPUs.

☆70

Alternatives and similar repositories for EuroBERT

Users that are interested in EuroBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

raphaelsty / LeNLP
View on GitHub
NLP with Rust for Python 🦀🐍
☆72Jun 9, 2026Updated last month
webis-de / lightning-ir
View on GitHub
One-stop shop for running and fine-tuning transformer-based language models for retrieval
☆65Jul 9, 2026Updated last week
stephantul / skeletoken
View on GitHub
Datamodels for hugging face tokenizers
☆108Jun 18, 2026Updated last month
lucasb-eyer / dotfiles
View on GitHub
My configuration files, loosely inspired by @sontek
☆39Jul 6, 2026Updated 2 weeks ago
nomic-ai / contrastors
View on GitHub
Train Models Contrastively in Pytorch
☆798Mar 26, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
solr-cool / solr-cool.github.io
View on GitHub
The Solr Package Directory and Sanctuary
☆13May 28, 2026Updated last month
huggingface / dedupe_estimator
View on GitHub
Chunk Dedupe Estimation
☆20Nov 5, 2024Updated last year
theroyallab / llm-prompt-templates
View on GitHub
Prompt Jinja2 templates for LLMs
☆36Jul 9, 2025Updated last year
zhangir-azerbayev / MetaMath
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
mixedbread-ai / baguetter
View on GitHub
Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…
☆210Aug 31, 2024Updated last year
henrikalbihn / gliner-as-a-service
View on GitHub
GLiNER model in a FastAPI microservice.
☆47Dec 11, 2024Updated last year
ibm-granite / granite-embedding-models
View on GitHub
☆77May 14, 2026Updated 2 months ago
biaslyze-dev / biaslyze
View on GitHub
The NLP Bias Identification Toolkit
☆39Sep 8, 2023Updated 2 years ago
penfever / wildchat-50m
View on GitHub
Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.
☆38Apr 1, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
AnswerDotAI / ModernBERT
View on GitHub
Bringing BERT into modernity via both architecture changes and scaling
☆1,701Mar 1, 2026Updated 4 months ago
bigdata-ustc / TechCD
View on GitHub
Source codes and datasets for paper "Leveraging Transferable Knowledge Concept Graph Embedding for Cold-Start Cognitive Diagnosis" (SIGIR…
☆21Mar 15, 2024Updated 2 years ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
View on GitHub
☆53Feb 10, 2025Updated last year
cipher982 / llm-benchmarks
View on GitHub
Benchmarking LLM Inference Speeds
☆14May 17, 2026Updated 2 months ago
zafstojano / policy-gradients
View on GitHub
A minimal hackable implementation of policy gradient methods (GRPO, PPO, REINFORCE)
☆16Feb 20, 2026Updated 5 months ago
huseinzol05 / transformers-continuous-batching
View on GitHub
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
☆29Mar 15, 2025Updated last year
HITsz-TMG / KaLM-Embedding
View on GitHub
Code for KaLM-Embedding models
☆116Jun 30, 2025Updated last year
gangiswag / llm-reranker
View on GitHub
☆63Jan 26, 2025Updated last year
lightonai / fast-plaid
View on GitHub
High-Performance Engine for Multi-Vector Search
☆268May 28, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
amazon-science / CodeSage
View on GitHub
CodeSage: Code Representation Learning At Scale (ICLR 2024)
☆122Oct 27, 2024Updated last year
donglinchen / text_classification
View on GitHub
Build text classifiers using 3 most popular machine learning / deep learning frameworks - Scikit-learn, PyTorch, TensorFlow
☆10Sep 22, 2021Updated 4 years ago
mozilla-ai / visual-dspy
View on GitHub
Visual demo of DSPy's prompt optimization on Gradio
☆15Apr 14, 2025Updated last year
igiannakas / tailscale
View on GitHub
Tailscale fixes
☆12Sep 17, 2024Updated last year
dagster-io / dagster-cloud-hybrid-quickstart
View on GitHub
Template for getting started with Hybrid Dagster Cloud
☆15Sep 19, 2025Updated 10 months ago
SonicCodes / subcloning
View on GitHub
implementation of https://arxiv.org/pdf/2312.09299
☆21Jul 3, 2024Updated 2 years ago
gabrfarina / exp-a-spiel
View on GitHub
Exploitability calculation for imperfect-information game benchmarks
☆37Apr 5, 2025Updated last year
tonywu71 / colpali-cookbooks
View on GitHub
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
☆356Jun 2, 2025Updated last year
dgarcia-eu / CMSS-Konstanz
View on GitHub
Computational Modelling of Social Systems at the University of Konstanz
☆11Jun 3, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
justinchiu / openlogprobs
View on GitHub
Extract full next-token probabilities via language model APIs
☆248Feb 23, 2024Updated 2 years ago
jxmorris12 / cde
View on GitHub
code for training & evaluating Contextual Document Embedding models
☆207May 14, 2025Updated last year
ByungKwanLee / Distill-R1
View on GitHub
Open-source RL Framework with Online Teacher-Student Distillation
☆22Mar 5, 2026Updated 4 months ago
google-research-datasets / Education-Dialogue-Dataset
View on GitHub
Dataset of conversations, generated by prompting Gemini Ultra. These are conversations between a teacher and a student, where the teacher…
☆35Oct 29, 2024Updated last year
fboerncke / private-prompts-prototype
View on GitHub
Private Prompts Prototype Documentation
☆38Oct 29, 2025Updated 8 months ago
chandar-lab / NeoBERT
View on GitHub
☆108Jun 2, 2025Updated last year
NohTow / PPL-MCTS
View on GitHub
Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22
☆66Oct 25, 2022Updated 3 years ago