Continual-Intelligence / SEALLinks
Self-Adapting Language Models
☆430Updated this week
Alternatives and similar repositories for SEAL
Users that are interested in SEAL are comparing it to the libraries listed below
Sorting:
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆311Updated 8 months ago
- Build your own visual reasoning model☆381Updated this week
- Tina: Tiny Reasoning Models via LoRA☆258Updated 3 weeks ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆649Updated 2 weeks ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆337Updated 6 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆486Updated last month
- ☆149Updated 2 months ago
- procedural reasoning datasets☆841Updated last week
- Dream 7B, a large diffusion language model☆764Updated last week
- ☆207Updated 3 months ago
- Pretraining code for a large-scale depth-recurrent language model☆782Updated last week
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆498Updated this week
- ☆569Updated 2 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,100Updated 4 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆514Updated this week
- ☆189Updated this week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆219Updated last month
- ☆157Updated last month
- prime-rl is a codebase for decentralized async RL training at scale☆341Updated this week
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆108Updated last month
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆223Updated 3 months ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆545Updated 3 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆295Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆447Updated 8 months ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆574Updated this week
- Code for ExploreTom☆83Updated 6 months ago
- Exploring Applications of GRPO☆230Updated last month
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆315Updated 7 months ago
- LIMO: Less is More for Reasoning☆960Updated 2 months ago
- TTRL: Test-Time Reinforcement Learning☆637Updated 2 weeks ago