twitter-research / hyperbolic-rl
☆55Updated 2 years ago
Alternatives and similar repositories for hyperbolic-rl:
Users that are interested in hyperbolic-rl are comparing it to the libraries listed below
- PyTorch Package For Quasimetric Learning☆41Updated 2 months ago
- ☆41Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Sandbox environment for generalizable agent research☆24Updated 2 years ago
- ☆29Updated 3 years ago
- ☆43Updated 4 months ago
- ☆29Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- ☆53Updated 2 months ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Clean, extensible implementation of MACAW [ICML 2021]☆13Updated 3 years ago
- General Modules for JAX☆62Updated 6 months ago
- Generalised UDRL☆37Updated 2 years ago
- An implementation of MuZero in JAX.☆54Updated 2 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆103Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆28Updated 2 years ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆13Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆38Updated 3 months ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆15Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆66Updated 2 years ago
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆39Updated 6 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆54Updated 10 months ago
- Building blocks for productive research☆47Updated this week
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 8 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆29Updated 5 months ago