schmidtdominik / RainbowView external linksLinks
Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M frames. π
β44Dec 11, 2021Updated 4 years ago
Alternatives and similar repositories for Rainbow
Users that are interested in Rainbow are comparing it to the libraries listed below
Sorting:
- Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Leβ¦β18Apr 13, 2021Updated 4 years ago
- Creating fixed-length vectors to describe RL/GA policiesβ20Oct 23, 2021Updated 4 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spacesβ15Oct 3, 2021Updated 4 years ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.β15Jan 3, 2023Updated 3 years ago
- A modular implementation of PPO, and soon hopefully other algorithms.β26Jan 16, 2024Updated 2 years ago
- This is a miniature race car gym-env for RL from states (and images)β28Nov 3, 2021Updated 4 years ago
- Variational Reinforcement Learningβ17Jul 25, 2024Updated last year
- Evaluating different engineering tricks that make RL workβ15Jun 3, 2021Updated 4 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learningβ30May 26, 2020Updated 5 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum gameβ21Jul 30, 2024Updated last year
- β19Jul 18, 2021Updated 4 years ago
- Code and links for over 25,000 trained Atari agentsβ98Aug 22, 2024Updated last year
- β28Jan 11, 2021Updated 5 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)β26Oct 11, 2022Updated 3 years ago
- Codes for Evolving Plastic ANNsβ14Dec 18, 2022Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.β176Nov 14, 2024Updated last year
- A custom open ai gym environment for solo experimentation.β12Apr 14, 2021Updated 4 years ago
- POPGym Library in JAXβ12Apr 15, 2024Updated last year
- A TF2.0 implementation of RL baselines.β10Sep 24, 2021Updated 4 years ago
- Gym wrapper for pysc2β10Sep 16, 2022Updated 3 years ago
- A2C is a special case of PPO!β22May 20, 2022Updated 3 years ago
- β26Apr 26, 2024Updated last year
- a modular reinforcement learning library with JAX agentsβ27Mar 3, 2025Updated 11 months ago
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brainβ11Jul 14, 2020Updated 5 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) θΆ³ηζΈΈζζΊθ½δ½β14May 25, 2023Updated 2 years ago
- Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".β13Jan 25, 2023Updated 3 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"β12Jul 12, 2021Updated 4 years ago
- This repository contains the code of the paper Equivariant Q Learning in Spatial Action Spacesβ11Nov 4, 2021Updated 4 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"β46Sep 20, 2023Updated 2 years ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).β29Oct 25, 2022Updated 3 years ago
- Upside-Down Reinforcement Learning (β κ€) implementation in PyTorch. Based on the paper published by JΓΌrgen Schmidhuber.β78Aug 13, 2020Updated 5 years ago
- Toolkit of Causal Model-based Reinforcement Learning.β33Jun 5, 2023Updated 2 years ago
- Procgen2: A community maintained fork of procgenβ12Aug 25, 2022Updated 3 years ago
- β14Jun 26, 2019Updated 6 years ago
- Docker containers of baseline agents for the Crafter environmentβ30Dec 14, 2021Updated 4 years ago
- A project copied from google-research which named motion-imitation was rewrited with PyTorchβ10Sep 30, 2022Updated 3 years ago
- MuJoCo models for Unitree Robotsβ12Nov 24, 2021Updated 4 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculationsβ49Apr 1, 2022Updated 3 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actionsβ30Jun 30, 2020Updated 5 years ago