The original Shared Recurrent Memory Transformer implementation
☆35Jul 11, 2025Updated 9 months ago
Alternatives and similar repositories for srmt
Users that are interested in srmt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Approach where the repulsive potential in an MPC pipeline is estimated by a neural model.☆24Mar 5, 2026Updated last month
- Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)☆32Jun 14, 2025Updated 10 months ago
- POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically desig…☆46Jul 7, 2025Updated 9 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆130Jun 11, 2025Updated 10 months ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆49May 23, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Natural Language Reinforcement Learning☆102Jul 30, 2025Updated 9 months ago
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 9 months ago
- ☆23Apr 17, 2026Updated 2 weeks ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 11 months ago
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆25Oct 27, 2024Updated last year
- [IROS-2025] MAPF-GPT-DDG is a scalable decentralized multi-agent pathfinding (MAPF) solver based on imitation learning. It builds upon MA…☆63Feb 21, 2026Updated 2 months ago
- ☆11May 18, 2025Updated 11 months ago
- ☆19Oct 28, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆64Mar 30, 2026Updated last month
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- Machine Learning from Human Preferences☆32Mar 23, 2026Updated last month
- Mixtral-based Ja-En (En-Ja) Translation model☆20Jan 6, 2025Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 7 months ago
- ☆96Dec 6, 2024Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆100May 16, 2025Updated 11 months ago
- The official GitHub page for the survey paper "A Survey of RWKV".☆32Jan 7, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆30Jun 5, 2025Updated 10 months ago
- ☆15Apr 11, 2024Updated 2 years ago
- 🔥 [ICLR 2026] Official implementation of Recurrent Action Transformer with Memory, an offline RL agent with memory mechanisms. https://s…☆23Nov 23, 2025Updated 5 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆171Oct 20, 2025Updated 6 months ago
- (ACL2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆35May 28, 2025Updated 11 months ago
- [AAMAS 2024] HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding☆14Mar 12, 2024Updated 2 years ago
- ☆45Jan 26, 2026Updated 3 months ago
- lib and multi traj with comments☆11Aug 30, 2022Updated 3 years ago
- 用Python的requests库写了一个简单的批量获取免费代理ip的程序,其中包括“下载+验证”程序。☆10Jul 29, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- autoredteam: code for training models that automatically red team other language models☆14Aug 9, 2023Updated 2 years ago
- ☆16Jul 23, 2024Updated last year
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 8 months ago
- Code for paper "PoseEmbroider:Towards a 3D, Visual, Semantic-aware Human Pose Representation" (ECCV 2024)☆18Nov 18, 2024Updated last year
- ☆51Dec 18, 2024Updated last year
- This is a repository containing example code for how you can use unit tests to protect against security regression.☆18Jun 26, 2017Updated 8 years ago
- ☆38May 15, 2025Updated 11 months ago