danijar / elementsLinks
Building blocks for productive research
☆67Updated 3 weeks ago
Alternatives and similar repositories for elements
Users that are interested in elements are comparing it to the libraries listed below
Sorting:
- General Modules for JAX☆72Updated 4 months ago
- Fast reinforcement learning research☆61Updated last year
- PyTorch Package For Quasimetric Learning☆45Updated last year
- ☆58Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Updated last year
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Sandbox environment for generalizable agent research☆27Updated 3 years ago
- ☆46Updated last year
- GPT implementation in Flax☆18Updated 4 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Updated 3 years ago
- Accelerated replay buffers in JAX☆46Updated 3 years ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Updated 3 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆74Updated last year
- Conservative Q learning in Jax☆57Updated 3 years ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆41Updated last year
- An implementation of MuZero in JAX.☆57Updated 3 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 3 years ago
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆70Updated 2 years ago
- ☆52Updated 3 years ago
- ☆55Updated 2 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆87Updated 2 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Updated 4 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆22Updated last year
- Vectorization techniques for fast population-based training.☆57Updated 3 years ago
- Reinforcement Learning via Supervised Learning☆72Updated 3 years ago
- Implementations of Temporal Difference InfoNCE (TD InfoNCE)☆33Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆57Updated last year
- JAX implementations of core Deep RL algorithms☆83Updated 3 years ago
- Corax: Core RL in JAX☆38Updated last year