Aloriosa / srmtLinks
The original Shared Recurrent Memory Transformer implementation
☆27Updated last month
Alternatives and similar repositories for srmt
Users that are interested in srmt are comparing it to the libraries listed below
Sorting:
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆96Updated last month
- ☆20Updated 2 weeks ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated last week
- ☆23Updated 3 weeks ago
- ☆10Updated 2 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆72Updated last month
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 4 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆115Updated 8 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆24Updated 3 weeks ago
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆36Updated 2 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated 3 months ago
- ☆13Updated 7 months ago
- ☆11Updated 11 months ago
- ☆52Updated 8 months ago
- ☆66Updated 3 months ago
- ☆24Updated 9 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆75Updated last month
- ☆47Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- accompanying material for sleep-time compute paper☆97Updated 2 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆27Updated this week
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- Resa: Transparent Reasoning Models via SAEs☆39Updated last month
- Official repo of paper LM2☆41Updated 5 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆38Updated 2 months ago
- A repository for research on medium sized language models.☆77Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆60Updated 4 months ago
- Official Code Release for "Training a Generally Curious Agent"☆26Updated last month
- ☆32Updated 2 months ago