allenai/olmo-cookbook

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/olmo-cookbook)

allenai / olmo-cookbook

OLMost every training recipe you need to perform data interventions with the OLMo family of models.

☆72

Alternatives and similar repositories for olmo-cookbook

Users that are interested in olmo-cookbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allenai / beaker-gantry
View on GitHub
Gantry provides an API that streamlines running experiments in Beaker
☆33Jul 20, 2026Updated last week
allenai / datamap-rs
View on GitHub
Data mapping framework for rust stuff
☆56Mar 25, 2026Updated 4 months ago
allenai / OLMo-core
View on GitHub
PyTorch building blocks for the OLMo ecosystem
☆1,433Updated this week
allenai / FlexOlmo
View on GitHub
Code and training scripts for FlexOlmo
☆151Apr 20, 2026Updated 3 months ago
allenai / dolma3
View on GitHub
☆81Apr 20, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mayank31398 / ladder-residual-inference
View on GitHub
☆14Jul 13, 2025Updated last year
taylorai / lm-deluge
View on GitHub
utilities for batched llm calls with retries
☆51Updated this week
allenai / AskOlmo
View on GitHub
☆15Nov 19, 2025Updated 8 months ago
lucidrains / genmimic
View on GitHub
Implementation of GenMimic, "From Generated Human Videos to Physically Plausible Robot Trajectories"
☆18Dec 9, 2025Updated 7 months ago
RadicalNumerics / spear
View on GitHub
Structured Primitives for Efficient Architecture Research
☆20Dec 22, 2025Updated 7 months ago
Zyphra / zcookbook
View on GitHub
Training hybrid models for dummies.
☆31Nov 1, 2025Updated 8 months ago
allenai / hybrid-preferences
View on GitHub
Learning to route instances for Human vs AI Feedback (ACL Main '25)
☆29Jul 23, 2025Updated last year
allenai / cached_path
View on GitHub
A file utility for accessing both local and remote files through a unified interface.
☆47Apr 27, 2026Updated 3 months ago
allenai / OLMo-Eval-Legacy
View on GitHub
Evaluation suite for LLMs
☆378Jul 11, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
BryceZhuo / HybridNorm
View on GitHub
The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization
☆19Mar 7, 2025Updated last year
acl-org / ethics-reading-list
View on GitHub
A list of ethics related resources for researchers and practitioners of Natural Language Processing and Computational Linguistics
☆34Oct 20, 2025Updated 9 months ago
allenai / allennlp-server
View on GitHub
A simple demo server for AllenNLP models.
☆28Jun 26, 2023Updated 3 years ago
allenai / OLMoE
View on GitHub
OLMoE: Open Mixture-of-Experts Language Models
☆1,045Sep 23, 2025Updated 10 months ago
allenai / fluid-benchmarking
View on GitHub
Fluid Language Model Benchmarking
☆29Sep 16, 2025Updated 10 months ago
allenai / asta-bench
View on GitHub
☆124Updated this week
explosion / spacy-curated-transformers
View on GitHub
spaCy entry points for Curated Transformers
☆32Mar 27, 2026Updated 4 months ago
ByteDance-Seed / SAIL
View on GitHub
Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"
☆85Oct 29, 2025Updated 9 months ago
TIGER-AI-Lab / PixelWorld
View on GitHub
The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆15Sep 12, 2025Updated 10 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
skhemlani / mReasoner
View on GitHub
mReasoner is a unified computational implementation of the model theory of thinking and reasoning
☆16Aug 17, 2023Updated 2 years ago
allenai / DrawEduMath
View on GitHub
Can VLMs understand students' hand-drawn math work?
☆19Jan 20, 2026Updated 6 months ago
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year
allenai / dolma
View on GitHub
Data and tools for generating and inspecting OLMo pre-training data.
☆1,528Nov 5, 2025Updated 8 months ago
kuzudb / dspy-kuzu-demo
View on GitHub
Intro to using DSPy with Kuzu to enrich the data within the Nobel Laureate mentorship network
☆16Sep 16, 2025Updated 10 months ago
NanoNets / Nanonets-OCR2
View on GitHub
Evaluations for Nanonets-OCR-1.5
☆18Oct 16, 2025Updated 9 months ago
allenai / duplodocus
View on GitHub
Tooling for exact and MinHash deduplication of large-scale text datasets
☆92Mar 24, 2026Updated 4 months ago
qinjr / RankFlow
View on GitHub
☆12Dec 15, 2022Updated 3 years ago
allenai / awesome-open-source-lms
View on GitHub
Friends of OLMo and their links.
☆363Sep 15, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
View on GitHub
☆53Feb 10, 2025Updated last year
QwenLM / ConsisEval
View on GitHub
☆14Jul 5, 2024Updated 2 years ago
apple / ml-vlsu
View on GitHub
☆14Nov 18, 2025Updated 8 months ago
carekit-apple / CareKitSample
View on GitHub
A sample app that demonstrates how to use CareKit's APIs
☆12Jun 25, 2021Updated 5 years ago
taranjeet / llmformatter
View on GitHub
Get deterministic output in any format like json from any LLM.
☆19Apr 25, 2023Updated 3 years ago
Tencent-Hunyuan / Hunyuan-4B
View on GitHub
☆16Aug 5, 2025Updated 11 months ago
mistralai / mistral-evals
View on GitHub
☆90Nov 21, 2025Updated 8 months ago