jity16 / ACE-Off-Policy-Actor-Critic-with-Causality-Aware-Entropy-Regularization
Official PyTorch implementation of "ACE:Off-Policy Actor-Critic with Causality-Aware Entropy Regularization"
☆18Updated 4 months ago
Related projects: ⓘ
- [ICML'2023] "AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners"☆45Updated 10 months ago
- ☆20Updated 11 months ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆52Updated 3 months ago
- Code for the Behavior Retrieval Paper☆29Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆24Updated last year
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆50Updated last year
- Official release of CompoSuite, a compositional RL benchmark☆44Updated 7 months ago
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆25Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆43Updated 6 months ago
- Code for "DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks"☆14Updated 4 months ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆20Updated 3 weeks ago
- ReDMan is an open-source simulation platform that provides a standardized implementation of safe RL algorithms for Reliable Dexterous Man…☆15Updated last year
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆34Updated 11 months ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆26Updated 2 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆71Updated 9 months ago
- Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.☆71Updated 4 months ago
- ☆51Updated last year
- Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).☆36Updated 7 months ago
- Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".☆37Updated 5 months ago
- ☆69Updated 2 years ago
- Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation (BUDS)☆43Updated 2 years ago
- Chain-of-Thought Predictive Control☆54Updated last year
- Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation (2023)☆26Updated last year
- ☆38Updated 10 months ago
- Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization (CoRL 2021)☆28Updated 2 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆87Updated 3 months ago
- Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)☆22Updated 2 years ago
- Code for NeurIPS 2023 paper "Active Vision Reinforcement Learning with Limited Visual Observability"☆48Updated 6 months ago
- From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data☆48Updated last year
- KitchenShift: Evaluating Zero-Shot Generalization of Imitation-Based Policy Learning Under Domain Shifts☆13Updated 2 years ago