upiterbarg / hihackLinks
[NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)
☆10Updated last year
Alternatives and similar repositories for hihack
Users that are interested in hihack are comparing it to the libraries listed below
Sorting:
- Learning for effective and efficient bilevel planning☆117Updated 2 weeks ago
- ☆46Updated 2 years ago
- Skeleton for scalable and flexible Jax RL implementations☆82Updated last year
- ☆16Updated last year
- Goal-Conditioned Reinforcement Learning with JAX☆171Updated last month
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆103Updated last year
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Simple maze environments using mujoco-py☆57Updated last year
- ☆47Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- ☆120Updated 5 years ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆128Updated 3 weeks ago
- ☆25Updated last year
- Conservative Q learning in Jax☆54Updated 2 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆70Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆85Updated 6 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆58Updated 8 months ago
- Change-Based Exploration Transfer☆35Updated 3 years ago
- Code for 'Mapping State Space using Landmarks for Universal Goal Reaching'.☆16Updated last year
- ☆39Updated 3 years ago
- Official release of CompoSuite, a compositional RL benchmark☆49Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Updated last year
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆64Updated last year
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆75Updated last year
- ☆15Updated last year
- Standalone library of frequently-used wrappers for dm_env environments.☆17Updated 11 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated last year
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆30Updated 4 years ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated last year
- ☆53Updated 3 years ago