johanobandoc / revisiting_rainbowView external linksLinks
Revisiting Rainbow
☆75Jun 9, 2021Updated 4 years ago
Alternatives and similar repositories for revisiting_rainbow
Users that are interested in revisiting_rainbow are comparing it to the libraries listed below
Sorting:
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- Model-based reinforcement learning in TensorFlow☆56Jul 27, 2021Updated 4 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- A collection of RL algorithms written in JAX.☆105Jul 5, 2022Updated 3 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 2 years ago
- ☆327Dec 19, 2024Updated last year
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆598Oct 28, 2020Updated 5 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆472Jul 6, 2023Updated 2 years ago
- A2C is a special case of PPO!☆22May 20, 2022Updated 3 years ago
- ☆99Mar 24, 2023Updated 2 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Oct 3, 2021Updated 4 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆44Apr 28, 2021Updated 4 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Jan 7, 2026Updated last month
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Aug 4, 2022Updated 3 years ago
- (CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning☆14Dec 27, 2022Updated 3 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆530Nov 22, 2022Updated 3 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆579Sep 10, 2021Updated 4 years ago
- This project was moved to: https://github.com/coax-dev/coax☆161Nov 28, 2022Updated 3 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Port of pybullet envs to gymnasium☆18Mar 4, 2025Updated 11 months ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆866Aug 12, 2024Updated last year
- DrQ: Data regularized Q☆420Jan 13, 2023Updated 3 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 5 years ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆19May 14, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆559Jun 26, 2023Updated 2 years ago
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆54Jul 7, 2021Updated 4 years ago
- ICRL 2020☆20Feb 18, 2020Updated 5 years ago
- ☆80Oct 3, 2023Updated 2 years ago
- [Experimental] TensorFlow 2 version of stable-baselines, temporary repository☆45Jan 25, 2020Updated 6 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55May 15, 2019Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago