floringogianu / atari-agents
Code and links for over 25,000 trained Atari agents
☆92Updated 3 weeks ago
Related projects: ⓘ
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆102Updated 3 weeks ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆96Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆129Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆189Updated 2 weeks ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆87Updated last month
- A tool for aggregating and plotting MARL experiment data.☆57Updated 3 weeks ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- PAIRED in PyTorch 🔥☆56Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆78Updated last year
- ☆56Updated 3 weeks ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆64Updated 10 months ago
- impact-driven-exploration☆125Updated 11 months ago
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- Benchmarking RL generalization in an interpretable way.☆128Updated 7 months ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆76Updated last year
- Object Centric Atari games☆43Updated this week
- ☆192Updated 7 months ago
- Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS☆49Updated 2 years ago
- Baselines for gymnax 🤖☆57Updated last year
- Collection of RL Environments built using Madrona☆23Updated last year
- Learning diverse options through the Laplacian representation.☆22Updated 8 months ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆101Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆157Updated 2 years ago
- ☆59Updated last month
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆46Updated last year
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆76Updated 2 years ago
- A collection of RL algorithms written in JAX.☆92Updated 2 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆41Updated 2 months ago