PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN
☆44Oct 4, 2020Updated 5 years ago
Alternatives and similar repositories for Munchausen-RL
Users that are interested in Munchausen-RL are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF…☆34Oct 10, 2020Updated 5 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆92Mar 4, 2023Updated 3 years ago
- Distributed & asynchronous DQN implementation using gRPC and PyTorch.☆10Feb 15, 2021Updated 5 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- Alphazero on GPU thanks to CUDA.jl☆33Aug 30, 2021Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆27Jul 14, 2021Updated 4 years ago
- Evolution of Discrete data with Reinforcement Learning☆13Dec 8, 2019Updated 6 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- ☆13Feb 21, 2024Updated 2 years ago
- The Concept Bottleneck Shift Detection (CBSD) methods for explaining and detecting various dataset shifts.☆14Jun 22, 2021Updated 4 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning☆34Oct 28, 2020Updated 5 years ago
- The Intermediate Goal of the project is to train a GPT like architecture to learn to summarise reddit posts from human preferences, as th…☆12Jul 14, 2021Updated 4 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- This gets the data from the Chainlink Price feeds in an easy way. Can use either an RPC_URL or the Chainlink Subgraph.☆13May 23, 2021Updated 4 years ago
- ☆14Feb 14, 2020Updated 6 years ago
- ☆17Jun 4, 2021Updated 4 years ago
- soft q learning and soft actor critic☆16Dec 23, 2018Updated 7 years ago
- ☆18Sep 7, 2023Updated 2 years ago
- Submission code of UEFDRL team to NeurIPS 2019 MineRL challenge (5th place)☆13Nov 13, 2020Updated 5 years ago
- ☆18Feb 7, 2021Updated 5 years ago
- Power Systems environment for OpenAI Gym☆18Apr 2, 2019Updated 6 years ago
- AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation☆15Jun 22, 2020Updated 5 years ago
- The WaveFunctionCollapse algorithm in Julia.☆22Jan 2, 2019Updated 7 years ago
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Oct 5, 2021Updated 4 years ago
- Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)☆13Oct 8, 2018Updated 7 years ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆19May 14, 2019Updated 6 years ago
- Parameter-Space Saliency Maps for Explainability☆23Mar 21, 2023Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆16Nov 18, 2020Updated 5 years ago
- ♕ A web based and Deep-Reinforcement-Learning-powered open source chess game.☆17Feb 22, 2026Updated 2 weeks ago
- Smart grid pricing by reinforcement learning☆19Dec 19, 2018Updated 7 years ago
- A Reinforcement Learning / Neural Network library, written in Rust.☆21Mar 7, 2021Updated 5 years ago
- Wasserstein Distance guided Adversarial Imitation Learning (WDAIL) with Reward Shape Exploration☆18Feb 9, 2021Updated 5 years ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆25Jun 17, 2025Updated 8 months ago
- Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.☆17Feb 17, 2021Updated 5 years ago
- Code for the paper, "First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization"☆27Jul 5, 2022Updated 3 years ago