Continual-Intelligence / SEALLinks
Self-Adapting Language Models
☆1,502Updated 3 months ago
Alternatives and similar repositories for SEAL
Users that are interested in SEAL are comparing it to the libraries listed below
Sorting:
- AlphaGo Moment for Model Architecture Discovery.☆1,114Updated 3 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆667Updated 2 weeks ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,162Updated 9 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆348Updated 4 months ago
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,732Updated 3 months ago
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆712Updated last month
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆488Updated this week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆913Updated 5 months ago
- dLLM: Simple Diffusion Language Modeling☆529Updated last week
- Official Repository of Absolute Zero Reasoner☆1,747Updated 2 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆477Updated 2 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆644Updated this week
- ☆1,335Updated 2 months ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,396Updated last month
- Official implementation of "Continuous Autoregressive Language Models"☆470Updated last week
- ☆702Updated last month
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆940Updated 5 months ago
- ☆1,161Updated 2 weeks ago
- An interface library for RL post training with environments.☆687Updated this week
- Post-training with Tinker☆1,932Updated this week
- Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs☆1,877Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆479Updated 2 months ago
- OpenAI Frontier Evals☆942Updated 2 weeks ago
- ☆271Updated 7 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆843Updated last month
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆447Updated 2 months ago
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆596Updated 5 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆231Updated this week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆275Updated 4 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆568Updated 3 months ago