twitter-research / hyperbolic-rlLinks
☆55Updated 2 years ago
Alternatives and similar repositories for hyperbolic-rl
Users that are interested in hyperbolic-rl are comparing it to the libraries listed below
Sorting:
- ☆44Updated 9 months ago
- PyTorch Package For Quasimetric Learning☆42Updated 8 months ago
- ☆46Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- ☆31Updated 4 years ago
- General Modules for JAX☆66Updated 3 months ago
- ☆54Updated 8 months ago
- Building blocks for productive research☆59Updated 5 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 2 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆73Updated 10 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 8 months ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 3 years ago
- Modular Single-file Reinfocement Learning Algorithms Library☆38Updated 2 years ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆28Updated 2 years ago
- ☆31Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆107Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆57Updated 2 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆80Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Baselines for gymnax 🤖☆67Updated 2 years ago
- Reinforcement Learning via Supervised Learning☆71Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆145Updated 2 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated 2 years ago