The original Shared Recurrent Memory Transformer implementation
☆34Jul 11, 2025Updated 8 months ago
Alternatives and similar repositories for srmt
Users that are interested in srmt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)☆32Jun 14, 2025Updated 9 months ago
- POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically desig…☆45Jul 7, 2025Updated 8 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Jun 11, 2025Updated 9 months ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆45May 23, 2025Updated 10 months ago
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 7 months ago
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 8 months ago
- [AAMAS 2026] Don’t Blind Your VLA: Aligning Visual Representations for OOD Generalization. https://blind-vla-paper.github.io☆61Jan 25, 2026Updated last month
- ☆16Feb 22, 2025Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 10 months ago
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆24Oct 27, 2024Updated last year
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆62Feb 21, 2026Updated last month
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 6 months ago
- ☆11May 18, 2025Updated 10 months ago
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆32Nov 2, 2025Updated 4 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- ☆64Jan 12, 2026Updated 2 months ago
- Machine Learning from Human Preferences☆30Feb 13, 2026Updated last month
- ☆18Mar 11, 2026Updated last week
- Mixtral-based Ja-En (En-Ja) Translation model☆20Jan 6, 2025Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 6 months ago
- ☆96Dec 6, 2024Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆97May 16, 2025Updated 10 months ago
- ☆29Jun 5, 2025Updated 9 months ago
- ☆15Apr 11, 2024Updated last year
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆170Oct 20, 2025Updated 5 months ago
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆34May 28, 2025Updated 9 months ago
- ☆34Jul 16, 2025Updated 8 months ago
- ☆43Jan 26, 2026Updated last month
- ☆16Jul 23, 2024Updated last year
- ☆15Jan 24, 2025Updated last year
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- ☆45Nov 1, 2025Updated 4 months ago
- [ICLR2025] Are Large Vision Language Models Good Game Players?☆12Mar 3, 2025Updated last year
- ☆27Nov 5, 2025Updated 4 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆39Dec 30, 2025Updated 2 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆47Mar 3, 2026Updated 2 weeks ago