upiterbarg / hihack
[NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)
☆10Updated last year
Alternatives and similar repositories for hihack:
Users that are interested in hihack are comparing it to the libraries listed below
- Learning for effective and efficient bilevel planning☆108Updated this week
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆100Updated 11 months ago
- Code for 'Mapping State Space using Landmarks for Universal Goal Reaching'.☆16Updated last year
- ☆119Updated 4 years ago
- Official release of CompoSuite, a compositional RL benchmark☆47Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆80Updated last year
- Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation (BUDS)☆48Updated 3 years ago
- ☆47Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆68Updated last year
- ☆56Updated 2 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆80Updated 2 years ago
- Conservative Q learning in Jax☆53Updated 2 years ago
- ☆38Updated 3 years ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated last year
- Reinforcement Learning via Supervised Learning☆71Updated 2 years ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆73Updated 11 months ago
- ☆84Updated 3 years ago
- Change-Based Exploration Transfer☆36Updated 3 years ago
- ☆16Updated last year
- ☆70Updated 2 years ago
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆23Updated last year
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated last year
- ☆22Updated 2 years ago
- ☆24Updated 10 months ago
- A benchmark for offline goal-conditioned RL and offline RL☆157Updated 3 weeks ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆43Updated last year