Continual-Intelligence / SEALLinks
Self-Adapting Language Models
☆1,693Updated 6 months ago
Alternatives and similar repositories for SEAL
Users that are interested in SEAL are comparing it to the libraries listed below
Sorting:
- AlphaGo Moment for Model Architecture Discovery.☆1,131Updated 2 months ago
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆755Updated this week
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,819Updated 5 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆515Updated last month
- dLLM: Simple Diffusion Language Modeling☆1,716Updated this week
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆597Updated 3 weeks ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆938Updated 8 months ago
- Official implementation of "Continuous Autoregressive Language Models"☆726Updated 2 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,187Updated last year
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,750Updated last month
- An interface library for RL post training with environments.☆1,112Updated this week
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆966Updated 8 months ago
- Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs☆2,171Updated 4 months ago
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆642Updated last week
- General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.☆2,028Updated this week
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆833Updated last month
- This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.☆1,197Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆577Updated 4 months ago
- Agent0 Series: Self-Evolving Agents from Zero Data☆1,028Updated last month
- ☆1,388Updated 4 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 7 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆825Updated 2 weeks ago
- ☆1,283Updated 2 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆592Updated last month
- ☆724Updated 2 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆496Updated 5 months ago
- Post-training with Tinker☆2,805Updated this week
- Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.☆1,970Updated last month
- On the Theoretical Limitations of Embedding-Based Retrieval☆622Updated 4 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆255Updated 2 months ago