google-deepmind / nao_top10
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for nao_top10
- GPT implementation in Flax☆18Updated 2 years ago
- ☆17Updated 5 months ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 2 years ago
- Accelerated replay buffers in JAX☆40Updated 2 years ago
- flexible meta-learning in jax☆12Updated last year
- ☆28Updated 2 years ago
- An implementation of MuZero in JAX.☆53Updated 2 years ago
- ☆41Updated last month
- Reinforcement Learning inside a 3D soccer simulation☆24Updated 2 months ago
- ☆13Updated 4 months ago
- ☆29Updated 2 years ago
- Simple JAX Graphics Library.☆23Updated 2 weeks ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 6 months ago
- Repo to reproduce the First-Explore paper results☆36Updated 2 weeks ago
- Vectorization techniques for fast population-based training.☆54Updated 2 years ago
- ☆42Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- Baselines for gymnax 🤖☆60Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- General Modules for JAX☆58Updated 3 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- An Open-Ended Agentic Simulator☆28Updated 3 months ago
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- A web based platform for collecting human actions in reinforcement learning environments☆27Updated last year
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆11Updated 4 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆14Updated 3 weeks ago
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆10Updated 3 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- A2C is a special case of PPO!☆19Updated 2 years ago