google-deepmind / enn_acmeLinks
☆31Updated 2 years ago
Alternatives and similar repositories for enn_acme
Users that are interested in enn_acme are comparing it to the libraries listed below
Sorting:
- ☆19Updated 2 years ago
- Generalised UDRL☆37Updated 3 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- ☆44Updated 10 months ago
- ☆28Updated 3 years ago
- ☆36Updated 2 years ago
- ☆54Updated 9 months ago
- ☆20Updated 2 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 2 years ago
- ☆22Updated 4 months ago
- General Modules for JAX☆66Updated 4 months ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 3 years ago
- Accelerated replay buffers in JAX☆43Updated 2 years ago
- Sandbox environment for generalizable agent research☆26Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- ☆13Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated 11 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆22Updated 4 years ago
- Vectorization techniques for fast population-based training.☆56Updated 2 years ago
- ☆17Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Representation Learning in RL☆14Updated 3 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆68Updated 11 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆57Updated 3 years ago
- ☆54Updated 3 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- ☆55Updated 2 years ago