upiterbarg / hihack
[NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)
☆9Updated last year
Related projects ⓘ
Alternatives and complementary repositories for hihack
- ☆42Updated last year
- ☆107Updated 4 years ago
- OGBench: Benchmarking Offline Goal-Conditioned RL☆79Updated 3 weeks ago
- ☆23Updated 2 years ago
- Official release of CompoSuite, a compositional RL benchmark☆46Updated 9 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆16Updated 5 months ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆21Updated 7 months ago
- Code for 'Mapping State Space using Landmarks for Universal Goal Reaching'.☆16Updated 10 months ago
- Learning for effective and efficient bilevel planning☆96Updated this week
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆31Updated last year
- Goal-Conditioned Reinforcement Learning with JAX☆94Updated this week
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆43Updated 10 months ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆78Updated 2 years ago
- Conservative Q learning in Jax☆51Updated last year
- Reinforcement Learning via Supervised Learning☆68Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- ☆53Updated 3 years ago
- Change-Based Exploration Transfer☆36Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆34Updated last year
- Code for the paper "Learning Options via Compression" at NeurIPS 2022☆22Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆63Updated last year
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆57Updated 5 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆59Updated last year
- Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation (BUDS)☆44Updated 2 years ago
- ☆69Updated 2 years ago
- ☆14Updated 7 months ago
- ☆52Updated last year
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆83Updated 9 months ago