jys5609 / MC-LAVE-RL
ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"
☆30Updated 9 months ago
Related projects: ⓘ
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆64Updated 2 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆49Updated 8 months ago
- Clean, extensible implementation of MACAW [ICML 2021]☆10Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆51Updated 5 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆37Updated 2 months ago
- On-Policy Policy Gradient Algorithms in JAX☆20Updated 7 months ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆17Updated last month
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆49Updated 8 months ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆40Updated 3 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 2 months ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆42Updated 2 years ago
- Official code repository for Prompt-DT.☆93Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆78Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆49Updated 11 months ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆51Updated 3 years ago
- ☆17Updated last year
- Official PyTorch implementation of "Discovering Hierarchical Achievements in Reinforcement Learning via Contrastive Learning" (NeurIPS 20…☆27Updated last month
- ☆46Updated last year
- Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.☆22Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆100Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆22Updated 8 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆65Updated 5 months ago
- ☆65Updated 2 months ago
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆12Updated 7 months ago
- ☆21Updated 8 months ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆32Updated 11 months ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆16Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 10 months ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago