Thisi is the official code base for paper "Think Before You Act: Decision Transformers with Internal Working Memory"
☆23Jul 12, 2024Updated last year
Alternatives and similar repositories for DT_Mem
Users that are interested in DT_Mem are comparing it to the libraries listed below
Sorting:
- ☆16Dec 9, 2023Updated 2 years ago
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆39Oct 12, 2023Updated 2 years ago
- ICML'2024: HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning☆24Jun 3, 2024Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆70Aug 8, 2022Updated 3 years ago
- Repo to reproduce the First-Explore paper results☆39Dec 25, 2024Updated last year
- RL training scripts for learning an agent using ProcTHOR.☆37Feb 18, 2025Updated last year
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- SineKAN: Kolmogorov-Arnold Networks Using Sinusoidal Activation Functions☆15Dec 19, 2024Updated last year
- ☆11Nov 8, 2023Updated 2 years ago
- ProxyExplainer for Graph Neural Networks☆15Oct 24, 2024Updated last year
- ☆10Oct 11, 2022Updated 3 years ago
- Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"☆11Aug 10, 2023Updated 2 years ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- ICLR'2024: Learning Multi-Agent Communication from Graph Modeling Perspective☆53Mar 11, 2024Updated last year
- The official code of TACL 2022, "Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition".☆11Oct 18, 2021Updated 4 years ago
- Multi-Agent Reinforcement Learning for Drones☆17May 14, 2022Updated 3 years ago
- An implementation of the maxflow algorithm by Yuri Boykov and Vladimir Kolmogorov.☆12Nov 26, 2014Updated 11 years ago
- ☆12Oct 24, 2025Updated 4 months ago
- VS Code Clinical Quality Language Extension☆11Updated this week
- word2vec java版本的一个实现☆10Apr 24, 2016Updated 9 years ago
- DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models☆13Nov 2, 2023Updated 2 years ago
- A simple agent powered by LLMs that performs tasks.☆13Apr 25, 2025Updated 10 months ago
- Repo for the walking robot's vision based navigation code☆10Jun 6, 2023Updated 2 years ago
- ☆10Jun 11, 2023Updated 2 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- Code for PolyTask: Learning Unified Policies through Behavior Distillation☆12Oct 13, 2023Updated 2 years ago
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆19Feb 7, 2025Updated last year
- A Lean formalisation of Maryna Viazovska's Fields Medal-winning solution to the sphere packing problem in dimension 8 and 24.☆50Updated this week
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated last year
- Dataset and code to reproduce the results of the paper "Evolving Structures in Complex Systems"☆11Dec 16, 2019Updated 6 years ago
- Repository for (for now) filing bug reports about PLAI.☆14Jul 5, 2025Updated 8 months ago
- [ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks☆30Feb 5, 2026Updated last month
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- Moral Machine Experiment on LLMs☆11Mar 2, 2026Updated last week
- All the tools that allow me to never ever open up Final Cut☆11Feb 16, 2025Updated last year
- Ma thesis @usyd☆10May 14, 2021Updated 4 years ago
- TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning☆11Oct 18, 2022Updated 3 years ago
- ☆11Sep 8, 2024Updated last year