allenai/dolma3

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/dolma3)

allenai / dolma3

☆81

Alternatives and similar repositories for dolma3

Users that are interested in dolma3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allenai / duplodocus
View on GitHub
Tooling for exact and MinHash deduplication of large-scale text datasets
☆92Mar 24, 2026Updated 4 months ago
allenai / datamap-rs
View on GitHub
Data mapping framework for rust stuff
☆56Mar 25, 2026Updated 4 months ago
allenai / OLMo-core
View on GitHub
PyTorch building blocks for the OLMo ecosystem
☆1,436Updated this week
allenai / olmes
View on GitHub
Reproducible, flexible LLM evaluations
☆390Mar 24, 2026Updated 4 months ago
allenai / olmo-cookbook
View on GitHub
OLMost every training recipe you need to perform data interventions with the OLMo family of models.
☆72Jul 21, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hkust-nlp / PreSelect
View on GitHub
[ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches
☆66Mar 4, 2025Updated last year
CodeCreator / WebOrganizer
View on GitHub
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
☆83May 2, 2025Updated last year
Zyphra / Zyda_processing
View on GitHub
☆44Jun 19, 2024Updated 2 years ago
allenai / decon
View on GitHub
decontamination
☆35Mar 4, 2026Updated 4 months ago
joehoover / cog-poet-vicuna-13b
View on GitHub
An instruction tuned large language model with extra support for poetry and verse generation
☆25Jun 5, 2023Updated 3 years ago
frenzymath / REAL-Prover
View on GitHub
☆32Jul 22, 2025Updated last year
asFeng / Diffuser
View on GitHub
☆10Jun 14, 2023Updated 3 years ago
princeton-pli / MeCo
View on GitHub
Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"
☆51Jun 30, 2025Updated last year
AbanteAI / LoCoDiff-bench
View on GitHub
☆33Oct 15, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aHapBean / NITP
View on GitHub
[ICML 2026] NITP: Next Implicit Token Prediction for LLM Pre-training
☆34May 26, 2026Updated 2 months ago
beanie00 / self-distillation-analysis
View on GitHub
Codebase for the work “Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?”
☆75Apr 14, 2026Updated 3 months ago
microsoft / data-efficacy
View on GitHub
Data Efficacy for Language Model Training
☆52May 29, 2026Updated 2 months ago
patrik-cihal / lean-graph
View on GitHub
Theorem relational dependencies automatic extraction and visualization as a graph for Lean4.
☆52Feb 15, 2026Updated 5 months ago
srush / tangent
View on GitHub
Source-to-Source Debuggable Derivatives in Pure Python
☆15Jan 23, 2024Updated 2 years ago
rishabhranawat / DataRater
View on GitHub
Open source implementation of DataRater (https://arxiv.org/abs/2505.17895)
☆26Sep 20, 2025Updated 10 months ago
allenai / infinigram-api
View on GitHub
☆102Jul 16, 2026Updated last week
hkust-nlp / KernelGYM
View on GitHub
[KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations [ICML…
☆196Mar 29, 2026Updated 4 months ago
iiis-ai / IterativeQuestionComposing
View on GitHub
[AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)
☆23Oct 2, 2025Updated 9 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Aleph-Alpha-Research / scaling
View on GitHub
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…
☆66Nov 18, 2025Updated 8 months ago
UmeanNever / RankSurprisalRatio
View on GitHub
[ACL 2026 Main] Official Repo for Paper "Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Ali…
☆17Jul 1, 2026Updated 3 weeks ago
wenquanlu / huginn-latent-cot
View on GitHub
[COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…
☆20Oct 4, 2025Updated 9 months ago
allenai / olmix
View on GitHub
☆41May 26, 2026Updated 2 months ago
Tele-AI / T1
View on GitHub
☆18Jul 25, 2025Updated last year
prashanthsadasivan / llava-ios
View on GitHub
Run LLaVA on iPhone
☆29Jul 5, 2024Updated 2 years ago
faabian / algebraic-combinatorics
View on GitHub
Automatic textbook formalization of Grinberg Algebraic Combinatorics
☆17Updated this week
siliconflow / siliconcloud-cookbook
View on GitHub
SiliconCloud Cookbook
☆24Mar 8, 2025Updated last year
thunlp / BlockFFN
View on GitHub
Source codes for paper "BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity".
☆19Jan 10, 2026Updated 6 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
cat-state / modded-nanogpt-moe
View on GitHub
☆17Sep 6, 2025Updated 10 months ago
allenai / IFBench
View on GitHub
☆161May 13, 2026Updated 2 months ago
menik1126 / UNComp
View on GitHub
[EMNLP 2025🔥] UNComp: Can Matrix Entropy Uncover Sparsity? -- A Compressor Design from an Uncertainty-Aware Perspective
☆20Jan 7, 2026Updated 6 months ago
Tencent-Hunyuan / Hunyuan-4B
View on GitHub
☆16Aug 5, 2025Updated 11 months ago
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year
frenzymath / LeanSearch-v2
View on GitHub
☆18May 18, 2026Updated 2 months ago
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,813Updated this week