The original Shared Recurrent Memory Transformer implementation
☆33Jul 11, 2025Updated 7 months ago
Alternatives and similar repositories for srmt
Users that are interested in srmt are comparing it to the libraries listed below
Sorting:
- ☆28Jul 7, 2025Updated 7 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Jun 11, 2025Updated 8 months ago
- ☆16Feb 22, 2025Updated last year
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 7 months ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆45May 23, 2025Updated 9 months ago
- Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)☆30Jun 14, 2025Updated 8 months ago
- ☆14Jan 24, 2025Updated last year
- ☆60Jan 12, 2026Updated last month
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Sep 22, 2025Updated 5 months ago
- ☆11May 18, 2025Updated 9 months ago
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆14Jun 6, 2025Updated 8 months ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- ☆15Apr 11, 2024Updated last year
- autoredteam: code for training models that automatically red team other language models☆15Aug 9, 2023Updated 2 years ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 9 months ago
- ☆14Mar 28, 2024Updated last year
- ☆43Jan 26, 2026Updated last month
- Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)☆17Dec 8, 2024Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆96May 16, 2025Updated 9 months ago
- ☆46Sep 27, 2025Updated 5 months ago
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- ☆27Jun 5, 2025Updated 8 months ago
- Approach where the repulsive potential in an MPC pipeline is estimated by a neural model.☆22Feb 23, 2026Updated last week
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆34May 28, 2025Updated 9 months ago
- A comprehensive and efficient long-context model evaluation framework☆31Updated this week
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 5 months ago
- ☆96Dec 6, 2024Updated last year
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆28Feb 25, 2025Updated last year
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- Collection of LLM completions for reasoning-gym task datasets☆30Jul 4, 2025Updated 7 months ago
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆24Oct 27, 2024Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆49Updated this week
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆169Oct 20, 2025Updated 4 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Jun 15, 2025Updated 8 months ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Mar 4, 2025Updated 11 months ago
- This is a repository containing example code for how you can use unit tests to protect against security regression.☆19Jun 26, 2017Updated 8 years ago