Continual-Intelligence / SEALLinks
Self-Adapting Language Models
☆800Updated 2 months ago
Alternatives and similar repositories for SEAL
Users that are interested in SEAL are comparing it to the libraries listed below
Sorting:
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆635Updated 3 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆343Updated 3 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆475Updated 2 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆873Updated 3 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆829Updated last month
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,151Updated 8 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆456Updated last month
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆609Updated last week
- AlphaGo Moment for Model Architecture Discovery.☆1,087Updated 2 months ago
- Post-training with Tinker☆550Updated this week
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆186Updated 2 weeks ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆322Updated 11 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆557Updated last month
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆588Updated 3 months ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆409Updated 3 weeks ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆701Updated this week
- ☆175Updated 2 months ago
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,673Updated last month
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆159Updated last month
- ☆1,273Updated 3 weeks ago
- Async RL Training at Scale☆669Updated this week
- Dream 7B, a large diffusion language model☆984Updated last week
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,309Updated 2 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆478Updated last week
- Build your own visual reasoning model☆409Updated last month
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆977Updated 2 weeks ago
- An agent benchmark with tasks in a simulated software company.☆556Updated 2 weeks ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆436Updated last month
- OpenAI Frontier Evals☆885Updated last week
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,161Updated this week