ml-jku / rudderView external linksLinks
RUDDER: Return Decomposition for Delayed Rewards
☆48Sep 17, 2020Updated 5 years ago
Alternatives and similar repositories for rudder
Users that are interested in rudder are comparing it to the libraries listed below
Sorting:
- A practical step-by-step guide to applying RUDDER☆35Nov 12, 2019Updated 6 years ago
- Code for demonstration example-task in RUDDER blog☆24May 19, 2020Updated 5 years ago
- Code to reproduce results on toy tasks and companion blog for the paper.☆22Jun 8, 2022Updated 3 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 2 years ago
- ☆10Apr 24, 2021Updated 4 years ago
- ☆12Dec 8, 2020Updated 5 years ago
- Reward Propagation using Graph Convolutional Networks☆13Jun 19, 2021Updated 4 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Jul 16, 2024Updated last year
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Apr 28, 2021Updated 4 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Oct 22, 2019Updated 6 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Nov 21, 2023Updated 2 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- A Library of MDP algorithms for Artificial Intelligence☆18Jul 16, 2019Updated 6 years ago
- ☆22Mar 28, 2025Updated 10 months ago
- Bayesian Reward Shaping Framework for Deep Reinforcement Learning☆25Mar 29, 2019Updated 6 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51May 26, 2021Updated 4 years ago
- Pytorch implementation of intrinsic curiosity module with proximal policy optimization☆55Dec 20, 2018Updated 7 years ago
- Gated Transformer Model for Computer Vision☆25Jul 11, 2021Updated 4 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Mar 6, 2025Updated 11 months ago
- A library of probabilistic model based RL algorithms in pytorch☆107Apr 14, 2021Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- ☆31Jan 16, 2023Updated 3 years ago
- Neural Networks for JAX☆84Sep 24, 2024Updated last year
- ☆33Dec 8, 2022Updated 3 years ago
- Repository for our ICML 2019 paper: Curiosity-Bottleneck☆34Nov 21, 2022Updated 3 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆33Apr 14, 2021Updated 4 years ago
- TD-Regularized Actor-Critic Methods☆36Dec 26, 2019Updated 6 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆40Dec 8, 2022Updated 3 years ago
- Intersection Management with iCACC theorm☆10Feb 7, 2020Updated 6 years ago
- Knowledge-Aware RL agents with Commonsense Reasoning☆79Mar 4, 2022Updated 3 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆35May 14, 2019Updated 6 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆40Aug 28, 2021Updated 4 years ago
- A metrics library for the JAX ecosystem☆40Mar 16, 2023Updated 2 years ago
- Article for Special Edition of Information: Machine Learning with Python☆14Jan 8, 2025Updated last year
- ☆11May 13, 2021Updated 4 years ago