Self-Adapting Language Models
☆1,711Aug 1, 2025Updated 7 months ago
Alternatives and similar repositories for SEAL
Users that are interested in SEAL are comparing it to the libraries listed below
Sorting:
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,880Aug 13, 2025Updated 6 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,191Jan 30, 2025Updated last year
- Official Repository of Absolute Zero Reasoner☆1,817Aug 24, 2025Updated 6 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆1,111Jun 8, 2025Updated 8 months ago
- [ICLR 2026] Learning to Reason without External Rewards☆394Jan 26, 2026Updated last month
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆183Jul 23, 2025Updated 7 months ago
- Open-source implementation of AlphaEvolve☆5,525Feb 4, 2026Updated last month
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆39Feb 7, 2026Updated last month
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Jul 3, 2025Updated 8 months ago
- ☆28Jun 5, 2025Updated 9 months ago
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆30Oct 12, 2025Updated 4 months ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,788Dec 29, 2025Updated 2 months ago
- ☆144Sep 29, 2025Updated 5 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆865Dec 29, 2025Updated 2 months ago
- The official repository of ALE-Bench☆160Feb 28, 2026Updated last week
- General benchmarking apparatus for running multi-agent systems against benchmarks☆42Jan 29, 2026Updated last month
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆974May 31, 2025Updated 9 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,338Jan 29, 2025Updated last year
- ☆37Aug 4, 2025Updated 7 months ago
- Hierarchical Reasoning Model Official Release☆12,339Sep 9, 2025Updated 5 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆92Jun 15, 2025Updated 8 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,135Nov 13, 2025Updated 3 months ago
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆8,695Feb 28, 2026Updated last week
- ☆17Aug 5, 2025Updated 7 months ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆1,002Feb 23, 2026Updated last week
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆3,111Jul 7, 2025Updated 8 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,529Aug 12, 2025Updated 6 months ago
- Official repo of paper LM2☆47Feb 13, 2025Updated last year
- Code for ICML 2024 paper☆35Sep 18, 2025Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆264Feb 12, 2026Updated 3 weeks ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- AlphaGo Moment for Model Architecture Discovery.☆1,135Dec 3, 2025Updated 3 months ago
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆12,273Dec 19, 2025Updated 2 months ago
- ☆726Nov 30, 2025Updated 3 months ago
- Training Proactive and Personalized LLM Agents☆102Jan 20, 2026Updated last month
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,643Nov 12, 2025Updated 3 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆62Oct 24, 2025Updated 4 months ago
- Minimal reproduction of DeepSeek R1-Zero☆12,896Feb 27, 2026Updated last week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆373Dec 12, 2024Updated last year