OLMost every training recipe you need to perform data interventions with the OLMo family of models.
☆73May 29, 2026Updated 3 weeks ago
Alternatives and similar repositories for olmo-cookbook
Users that are interested in olmo-cookbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jul 13, 2025Updated 11 months ago
- decontamination☆33Mar 4, 2026Updated 3 months ago
- CompChomper is a framework for measuring how LLMs perform at code completion.☆21Apr 29, 2025Updated last year
- Data mapping framework for rust stuff☆54Mar 25, 2026Updated 2 months ago
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆47Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆15Jul 5, 2024Updated last year
- PyTorch building blocks for the OLMo ecosystem☆1,289Updated this week
- Just a repository that will house some MLPs and their variants, so to avoid having to reimplement them again and again for different proj…☆50Apr 27, 2026Updated last month
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 3 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆11Dec 30, 2024Updated last year
- A library for Partially Homomorphic Encryption in Python☆12May 30, 2017Updated 9 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 10 months ago
- Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems☆15Nov 11, 2023Updated 2 years ago
- ☆12Sep 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Rust library for indexing and quickly searching large pretraining corpora☆31Oct 30, 2025Updated 7 months ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆29Jul 23, 2025Updated 10 months ago
- A Descope authentication plugin for Reflex☆29Sep 23, 2025Updated 8 months ago
- Automate the creation of high quality research papers in latex. Powered by Swarms 🤖☆11Dec 1, 2025Updated 6 months ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 3 years ago
- JIT-compiled GPU kernels for quantum chemistry☆35Jan 30, 2026Updated 4 months ago
- A file-backed dictionary for Python☆12Aug 15, 2022Updated 3 years ago
- Granite Kitchen -- "appliances" for use by the Granite Cookbooks such as inference platforms☆29Updated this week
- ☆33Nov 20, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12Mar 19, 2021Updated 5 years ago
- Building LLMs from scratch following the book from S. Raschka☆34Mar 27, 2025Updated last year
- Official implementation of UnifiedReward & UnifiedReward-Think☆18Jun 18, 2025Updated last year
- simplest online-softmax notebook for explain Flash Attention☆17Jan 27, 2026Updated 4 months ago
- A Lean4 script for robustly verifying submitted proofs of theorems and implementations of functions☆46Apr 22, 2026Updated last month
- Conditional Linear Dynamical Systems☆17Oct 7, 2025Updated 8 months ago
- Data and tools for generating and inspecting OLMo pre-training data.☆1,508Nov 5, 2025Updated 7 months ago
- Toolkit for building prompt templates for language models☆12Sep 30, 2022Updated 3 years ago
- Learn LangChain for Genearative AI with OpenAI API using Python☆11Feb 15, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization, NeurIPS 2022☆18Nov 23, 2022Updated 3 years ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 3 years ago
- ☆16Apr 23, 2026Updated last month
- Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.☆14Feb 8, 2023Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆42Jan 26, 2025Updated last year
- [ICLR2026] The first W4A4KV4 quantized + 50% sparse LLMs!☆32Jan 26, 2026Updated 4 months ago
- u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…☆19Jul 2, 2020Updated 5 years ago