CMU-AIRe/QED-Nano

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CMU-AIRe/QED-Nano)

CMU-AIRe / QED-Nano

Training tiny models to prove hard theorems

☆81

Alternatives and similar repositories for QED-Nano

Users that are interested in QED-Nano are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
CMU-AIRe / POPE
View on GitHub
☆27Jan 31, 2026Updated 5 months ago
allenai / olmix
View on GitHub
☆41May 26, 2026Updated last month
IanYHWu / rc
View on GitHub
Public-facing codebase accompanying: "Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL"
☆36Feb 6, 2026Updated 5 months ago
cocoa-org / NanoRollout
View on GitHub
Scale digital agent rollouts without pain.
☆34Jun 18, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
analokmaus / kaggle-aimo2-fast-math-r1
View on GitHub
Kaggle AIMO2 solution with token-efficient reasoning LLM recipes
☆50Aug 7, 2025Updated 11 months ago
RefineBench / refinebench-eval
View on GitHub
Official code and dataset for our paper: RefineBench: Evaluating Refinement Capability of Language Models via Checklists
☆17Dec 1, 2025Updated 7 months ago
facebookresearch / threadweaver
View on GitHub
The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models
☆67Apr 8, 2026Updated 3 months ago
chen-hao-chao / mdm-prime-v2
View on GitHub
MDM-Prime-v2: Binary Encoding and Index Shuffling Enable Scaling of Diffusion Language Models
☆27May 23, 2026Updated last month
anadim / smallest-addition-transformer-claude-code
View on GitHub
6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.
☆22Feb 19, 2026Updated 5 months ago
violetxi / ExpRL
View on GitHub
☆19Jun 16, 2026Updated last month
feng-rrRay / Continual-Harness-ARC-AGI-3
View on GitHub
Official implementation of Continual Harness (arxiv.org/abs/2605.09998) on ARC-AGI-3
☆33Jul 3, 2026Updated 2 weeks ago
facebookresearch / darling
View on GitHub
Official Implementation of the paper "Jointly Reinforcing Diversity and Quality in Language Model Generations"
☆61May 8, 2026Updated 2 months ago
HazyResearch / scaling-verification
View on GitHub
☆26Sep 4, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
fjzzq2002 / WeightWatch
View on GitHub
Official Repository of Paper "Watch the Weights: Unsupervised monitoring and control of fine-tuned LLMs"
☆15Sep 25, 2025Updated 9 months ago
ServiceNow / PipelineRL
View on GitHub
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆427Updated this week
sheriyuo / ETS
View on GitHub
[ICML 2026] ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment
☆19May 15, 2026Updated 2 months ago
illinoisdata / ElasticNotebook
View on GitHub
Enabling Live Migration for Computational Notebooks.
☆13Mar 11, 2024Updated 2 years ago
ScalingIntelligence / kernelbench-tinker
View on GitHub
Tinker ↔ KernelBench Integration enabling RL for GPU Kernel Generation
☆29Mar 5, 2026Updated 4 months ago
hazan-lab / flash-stu
View on GitHub
PyTorch implementation of the Flash Spectral Transform Unit.
☆22Sep 19, 2024Updated last year
ruiqi-zhong / nlparam
View on GitHub
Augmenting Statistical Models with Natural Language Parameters
☆28Sep 17, 2024Updated last year
kuleshov-group / d2
View on GitHub
d2: Improved Techinques for Training Reasonoing Diffusion Language Models
☆16Mar 25, 2026Updated 3 months ago
togethercomputer / aurora
View on GitHub
☆72Apr 30, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hallerite / ludic
View on GitHub
Ludic – an LLM-RL library for the era of experience
☆67Jan 9, 2026Updated 6 months ago
lucidrains / neural-grok
View on GitHub
Explorations into the proposed NeuralGrok from Zhou et al. of EPFL
☆15Oct 18, 2025Updated 9 months ago
allenai / SERA
View on GitHub
Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.
☆147May 25, 2026Updated last month
neelsomani / kv-marketplace
View on GitHub
Cross-GPU KV Cache Marketplace
☆26Nov 12, 2025Updated 8 months ago
slime-n / slime-n
View on GitHub
A Multi-Policy, Multi-Agent RL Training Framework
☆30Jun 16, 2026Updated last month
insait-institute / open-proof-corpus
View on GitHub
This repository contains the code for the paper The Open Proof Corpus: Building a Large-Scale, Human-Validated Dataset of LLM-Generated P…
☆18Aug 4, 2025Updated 11 months ago
sanyalsunny111 / Looped-GPT
View on GitHub
Minimal and highly hackable implementation of Looped Transformers with GPT
☆25Mar 8, 2026Updated 4 months ago
PrimeIntellect-ai / renderers
View on GitHub
Programmable chat templates for LLM training and inference.
☆133Updated this week
lucidrains / HiLAM
View on GitHub
Implementation of the Hierarchical Latent Action Model, proposed by Hanjung Kim et al. of Yonsei University
☆17May 6, 2026Updated 2 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
shadowkiller33 / Contrast-Instruction
View on GitHub
☆19Oct 2, 2023Updated 2 years ago
Kripner / nanoproof
View on GitHub
Minimal open-source implementation of AlphaProof and HyperTree Proof Search.
☆87May 13, 2026Updated 2 months ago
goedelcodeprover / Goedel-Code-Prover
View on GitHub
☆49Apr 12, 2026Updated 3 months ago
lucidrains / disco-rl-pytorch
View on GitHub
Implementation and explorations into DiscoRL, Discovering state-of-the-art reinforcement learning algorithms, David Silver's last work at…
☆20Jun 13, 2026Updated last month
s-vco / s-vco
View on GitHub
Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images
☆19Jun 4, 2025Updated last year
axolotl-ai-cloud / grpo_code
View on GitHub
A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.
☆41Apr 4, 2025Updated last year
qzp2018 / UniECS
View on GitHub
Official implement of CIKM2025: 《UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion》
☆21Sep 17, 2025Updated 10 months ago