zhentingqi/evolm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhentingqi/evolm)

zhentingqi / evolm

☆75

Alternatives and similar repositories for evolm

Users that are interested in evolm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rosieyzh / openrlhf-pretrain
View on GitHub
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
☆29Oct 14, 2025Updated 9 months ago
Frostlinx / Socratic-Zero
View on GitHub
Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning
☆37Oct 26, 2025Updated 8 months ago
Interplay-LM-Reasoning / Interplay-LM-Reasoning
View on GitHub
[ICML 2026 Spotlight] On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models
☆162Jun 8, 2026Updated last month
alex-damian / EOS
View on GitHub
☆15Sep 29, 2022Updated 3 years ago
cmu-mind / RISE
View on GitHub
☆34Oct 31, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Utaotao / ProFit
View on GitHub
☆35Jan 20, 2026Updated 6 months ago
fjzzq2002 / WeightWatch
View on GitHub
Official Repository of Paper "Watch the Weights: Unsupervised monitoring and control of fine-tuned LLMs"
☆15Sep 25, 2025Updated 9 months ago
epfml / llm-optimizer-benchmark
View on GitHub
Benchmarking Optimizers for LLM Pretraining
☆60May 3, 2026Updated 2 months ago
facebookresearch / PhysicsLM4
View on GitHub
Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality
☆356May 20, 2026Updated 2 months ago
facebookresearch / scalable-curvature
View on GitHub
Code for Dayal Kalra's research internship on scalable curvature measures for neural networks.
☆29Feb 3, 2026Updated 5 months ago
hanningzhang / ER-PRM
View on GitHub
☆20Dec 14, 2024Updated last year
ByteDance-Seed / DATAMASK
View on GitHub
Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning
☆21Jan 4, 2026Updated 6 months ago
LeiLiLab / HardTestGen
View on GitHub
☆17Jan 27, 2026Updated 5 months ago
VITA-Group / ProgressiveDD
View on GitHub
[ICLR 2024] "Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality" by Xuxi Chen*, Yu Yang*, Zhangyang Wang, Baha…
☆15May 18, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
yongchao98 / PROMST
View on GitHub
Automatic prompt optimization framework for multi-step agent tasks.
☆37Nov 12, 2024Updated last year
OPTML-Group / DP4TL
View on GitHub
[NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…
☆14Oct 12, 2023Updated 2 years ago
AnonymousNIPS2019 / DeepnetHessian
View on GitHub
The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size
☆19May 19, 2019Updated 7 years ago
tianyi-lab / Moltbook_Socialization
View on GitHub
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook
☆18Feb 17, 2026Updated 5 months ago
sanyalsunny111 / Looped-GPT
View on GitHub
Minimal and highly hackable implementation of Looped Transformers with GPT
☆25Mar 8, 2026Updated 4 months ago
alexOarga / compositional_reasoning
View on GitHub
[NeurIPS'25] Generalizable Reasoning through Compositional Energy Minimization
☆28Oct 28, 2025Updated 8 months ago
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆14Aug 8, 2025Updated 11 months ago
kvfrans / matrix-whitening
View on GitHub
Code for "What really matters in matrix-whitening optimizers?"
☆25Oct 31, 2025Updated 8 months ago
RulinShao / massive-serve
View on GitHub
Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.
☆26Jun 6, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sunjie279 / SimCT-
View on GitHub
☆21May 22, 2026Updated last month
lili-chen / self-questioning-lm
View on GitHub
Self-Questioning Language Models
☆57Mar 30, 2026Updated 3 months ago
HaoKang-Timmy / LatencySensitiveBench
View on GitHub
First Latency-Aware Competitive LLM Agent Benchmark
☆32Jun 3, 2025Updated last year
huanranchen / LLMLandscape
View on GitHub
The loss landscape of Large Language Models resemble basin!
☆41Jul 8, 2025Updated last year
haidongz-usc / CaesarNeRF
View on GitHub
This repo is for CaesarNeRF: Calibrated Semantic Representation for Few-Shot Generalizable Neural Rendering.
☆14Mar 6, 2024Updated 2 years ago
shawnricecake / Heima
View on GitHub
[ICML 2026] Heima
☆75May 20, 2026Updated 2 months ago
BaohaoLiao / SAGE
View on GitHub
Self-Hinting Language Models Enhance Reinforcement Learning
☆26Mar 28, 2026Updated 3 months ago
adymaharana / d2pruning
View on GitHub
☆44Oct 13, 2023Updated 2 years ago
tim-lawson / mlsae
View on GitHub
Multi-Layer Sparse Autoencoders (ICLR 2025)
☆30Feb 6, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Model-GLUE / Model-GLUE
View on GitHub
☆18Aug 19, 2024Updated last year
yifeiwang77 / Self-Correction
View on GitHub
☆20Nov 3, 2024Updated last year
cvenhoff / steering-thinking-llms
View on GitHub
☆38Jul 9, 2025Updated last year
XiangchengZhang / Diffusion-inference-scaling
View on GitHub
Official Implementation for Inference-time Scaling of Diffusion Models through Classical Search
☆33Oct 8, 2025Updated 9 months ago
Hyun1A / CPE
View on GitHub
[ICLR 2025] Official PyTorch Implementation for CPE: Concept Pinpoint Eraser for Text-to-image Diffusion Models via Residual Attention Ga…
☆13Apr 7, 2025Updated last year
sail-sg / understand-r1-zero
View on GitHub
Understanding R1-Zero-Like Training: A Critical Perspective
☆1,267Aug 27, 2025Updated 10 months ago
YujiaBao / Predict-then-Interpolate
View on GitHub
"Predict, then Interpolate: A Simple Algorithm to Learn Stable Classifiers" ICML 2021
☆17Jun 1, 2021Updated 5 years ago