ml-jku / rudderView external linksLinks
RUDDER: Return Decomposition for Delayed Rewards
☆48Sep 17, 2020Updated 5 years ago
Alternatives and similar repositories for rudder
Users that are interested in rudder are comparing it to the libraries listed below
Sorting:
- A practical step-by-step guide to applying RUDDER☆35Nov 12, 2019Updated 6 years ago
- Code for demonstration example-task in RUDDER blog☆24May 19, 2020Updated 5 years ago
- Code to reproduce results on toy tasks and companion blog for the paper.☆22Jun 8, 2022Updated 3 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 2 years ago
- ☆10Apr 24, 2021Updated 4 years ago
- ☆12Dec 8, 2020Updated 5 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- neuronal message passing☆19Oct 30, 2018Updated 7 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Apr 28, 2021Updated 4 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Oct 22, 2019Updated 6 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Nov 21, 2023Updated 2 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- ☆22Mar 28, 2025Updated 10 months ago
- ☆21Dec 22, 2020Updated 5 years ago
- ☆39Oct 23, 2025Updated 3 months ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆25Mar 29, 2019Updated 6 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51May 26, 2021Updated 4 years ago
- MADDPG in Ray/RLlib☆24Jul 22, 2020Updated 5 years ago
- Gated Transformer Model for Computer Vision☆25Jul 11, 2021Updated 4 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Mar 6, 2025Updated 11 months ago
- Jupyter notebook exercises of "Reinforcement Learning: An Introduction", Richard S. Sutton and Andrew G. Barto☆27Jul 23, 2024Updated last year
- A library of probabilistic model based RL algorithms in pytorch☆107Apr 14, 2021Updated 4 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- ☆31Jan 16, 2023Updated 3 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆33Apr 14, 2021Updated 4 years ago
- ☆33Dec 8, 2022Updated 3 years ago
- NeurIPS 2022: Tree Mover’s Distance: Bridging Graph Metrics and Stability of Graph Neural Networks☆37Aug 4, 2023Updated 2 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- Intersection Management with iCACC theorm☆10Feb 7, 2020Updated 6 years ago
- TD-Regularized Actor-Critic Methods☆36Dec 26, 2019Updated 6 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆40Dec 8, 2022Updated 3 years ago
- Topic modelling and co-occurrence analysis of the bio-economy☆10Jul 17, 2017Updated 8 years ago
- ☆13Jul 20, 2023Updated 2 years ago
- Contextual Bandit Spectral Representation Learner☆12Oct 25, 2022Updated 3 years ago
- ☆11May 13, 2021Updated 4 years ago
- Faster access to Tesseract-OCR from Python☆13Jun 8, 2021Updated 4 years ago
- Sudoku solver in Golang☆10Sep 6, 2020Updated 5 years ago