floringogianu / atari-agents
Code and links for over 25,000 trained Atari agents
☆94Updated 7 months ago
Alternatives and similar repositories for atari-agents:
Users that are interested in atari-agents are comparing it to the libraries listed below
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆109Updated 7 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- PAIRED in PyTorch 🔥☆58Updated 2 years ago
- Baselines for gymnax 🤖☆66Updated last year
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆111Updated 3 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆150Updated last week
- impact-driven-exploration☆130Updated last year
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆70Updated last year
- ☆298Updated 3 months ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆115Updated 7 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆141Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆160Updated 3 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Synchronized Curriculum Learning for RL Agents☆41Updated last week
- A tool for aggregating and plotting MARL experiment data.☆76Updated 2 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 10 months ago
- ☆74Updated 9 months ago
- ☆218Updated 4 months ago
- ☆44Updated last year
- Benchmarking RL generalization in an interpretable way.☆151Updated 3 weeks ago
- Nethack Learning Environment Wrapper for Language Interface☆36Updated last year
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆201Updated 3 years ago
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆62Updated last year
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆36Updated 2 years ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆46Updated 2 years ago