jbkjr / train-procgen-pytorch
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14Updated 11 months ago
Alternatives and similar repositories for train-procgen-pytorch:
Users that are interested in train-procgen-pytorch are comparing it to the libraries listed below
- ☆19Updated 2 years ago
- Interpreting how transformers simulate agents performing RL tasks☆79Updated last year
- ☆28Updated 2 years ago
- ☆31Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Scaling scaling laws with board games.☆48Updated last year
- ☆37Updated 8 months ago
- Redwood Research's transformer interpretability tools☆14Updated 3 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 3 years ago
- An environment for learning formal mathematical reasoning from scratch☆66Updated 8 months ago
- ☆36Updated last year
- ☆51Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 7 months ago
- An environment for benchmarking commonsense agents☆28Updated 4 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 10 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆54Updated 2 years ago
- ☆53Updated 5 months ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Updated 2 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- General Modules for JAX☆64Updated last week
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Baselines for gymnax 🤖☆66Updated 2 years ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Adaptive Subgoal Search☆19Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration☆32Updated 4 years ago
- Code for the paper "Understanding RL Vision"☆46Updated 2 years ago