test-time-training/discover

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/test-time-training/discover)

test-time-training / discover

☆611

Alternatives and similar repositories for discover

Users that are interested in discover are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ypwang61 / ThetaEvolve
View on GitHub
ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…
☆170Feb 27, 2026Updated 4 months ago
test-time-training / e2e
View on GitHub
Official JAX implementation of End-to-End Test-Time Training for Long Context
☆625Feb 15, 2026Updated 5 months ago
skydiscover-ai / skydiscover
View on GitHub
AI-Driven Scientific and Algorithmic Discovery
☆585Jun 14, 2026Updated last month
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,088Updated this week
Zhiyuan-Zeng / RLVE
View on GitHub
[ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
☆225Apr 30, 2026Updated 2 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
caoshiyi / K-Search
View on GitHub
Automated High-Performance GPU Kernel Generation
☆120Jun 1, 2026Updated last month
algorithmicsuperintelligence / openevolve
View on GitHub
Open-source implementation of AlphaEvolve
☆6,789Updated this week
SakanaAI / ShinkaEvolve
View on GitHub
ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution 🧬
☆1,291Updated this week
lasgroup / SDPO
View on GitHub
Reinforcement Learning via Self-Distillation (SDPO)
☆1,021Jul 1, 2026Updated 3 weeks ago
idanshen / Self-Distillation
View on GitHub
☆662Apr 7, 2026Updated 3 months ago
SakanaAI / robust-kbench
View on GitHub
☆101Nov 22, 2025Updated 8 months ago
FrontierCS / Frontier-CS
View on GitHub
A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.
☆285Updated this week
ByteDance-Seed / In-Place-TTT
View on GitHub
☆248Apr 21, 2026Updated 3 months ago
aakaran / reasoning-with-sampling
View on GitHub
☆438Nov 7, 2025Updated 8 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ScalingIntelligence / KernelBench
View on GitHub
KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)
☆1,156Mar 24, 2026Updated 4 months ago
flashinfer-ai / flashinfer-bench
View on GitHub
Building the Virtuous Cycle for AI-driven LLM Systems
☆261May 1, 2026Updated 2 months ago
SakanaAI / ALE-Bench
View on GitHub
The official repository of ALE-Bench
☆201Jul 16, 2026Updated last week
open-tinker / OpenTinker
View on GitHub
OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆676Mar 21, 2026Updated 4 months ago
PRIME-RL / TTRL
View on GitHub
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
☆1,103Apr 15, 2026Updated 3 months ago
GMLR-Penn / Multiplex-Thinking
View on GitHub
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge
☆131May 24, 2026Updated 2 months ago
ypwang61 / One-Shot-RLVR
View on GitHub
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
☆444Mar 11, 2026Updated 4 months ago
Human-Agent-Society / CORAL
View on GitHub
🔥🔥COLM 2026🔥🔥 CORAL is a robust, lightweight infrastructure for multi-agent autonomous self-evolution, built for autoresearch. W…
☆841Updated this week
zksha / alma
View on GitHub
ALMA (Automated meta-Learning of Memory designs for Agentic systems) is a framework that meta-learns memory designs to replace human-engi…
☆248Apr 8, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,727Updated this week
Imbernoulli / MLS-Bench
View on GitHub
☆73Updated this week
verl-project / verl
View on GitHub
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
☆22,649Updated this week
radixark / miles
View on GitHub
Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.
☆1,784Updated this week
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆7,621Updated this week
thinking-machines-lab / tinker-cookbook
View on GitHub
Post-training with Tinker
☆3,906Updated this week
recursive-org / first-steps-toward-automated-ai-research
View on GitHub
Research artifacts from Recursive's automated AI research system
☆184Jun 11, 2026Updated last month
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,024Jul 15, 2026Updated last week
LukeBailey181 / sgs
View on GitHub
☆76Apr 26, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,723Updated this week
a1600012888 / LaCT
View on GitHub
Code release for paper "Test-Time Training Done Right"
☆499Jan 5, 2026Updated 6 months ago
OpenRewardAI / openreward-cookbook
View on GitHub
Training and evaluating with OpenReward
☆33Apr 28, 2026Updated 2 months ago
axon-rl / gem
View on GitHub
A Gym for Agentic LLMs
☆502Jan 21, 2026Updated 6 months ago
VsonicV / es-at-scale
View on GitHub
This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"
☆376Jun 26, 2026Updated 3 weeks ago
linhaowei1 / SLD
View on GitHub
[ICLR26] AI-based scaling law discovery
☆31Jan 30, 2026Updated 5 months ago
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,150Nov 13, 2025Updated 8 months ago