Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.
☆37Oct 14, 2020Updated 5 years ago
Alternatives and similar repositories for Regularized-GradientTD
Users that are interested in Regularized-GradientTD are comparing it to the libraries listed below
Sorting:
- ☆23Nov 9, 2021Updated 4 years ago
- Experiment utility code, specifically designed for use with Compute Canada.☆11Jan 27, 2025Updated last year
- Reinforcement learning algorithms☆41Feb 27, 2019Updated 7 years ago
- ☆13May 30, 2019Updated 6 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- ☆27Mar 11, 2025Updated 11 months ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- ☆10Apr 24, 2021Updated 4 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms☆27Aug 14, 2021Updated 4 years ago
- ☆15Jun 1, 2023Updated 2 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Apr 1, 2022Updated 3 years ago
- A Python Toolkit for Managing a Large Number of Experiments☆31Feb 9, 2024Updated 2 years ago
- Retrieve information from DBLP and update BibTex files automatically☆54Jun 4, 2022Updated 3 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆103Mar 24, 2023Updated 2 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- ☆15Apr 5, 2023Updated 2 years ago
- An efficient remote-onboard architecture for real-time Reinforcement Learning☆18Jun 28, 2024Updated last year
- Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023☆21Nov 4, 2024Updated last year
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆19Apr 3, 2018Updated 7 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Jul 18, 2025Updated 7 months ago
- ☆328Dec 19, 2024Updated last year
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Randomized Value Functions via Multiplicative Normalizing Flows☆17Jan 1, 2023Updated 3 years ago
- Implementation of Russo and Van Roy work on Information Directed Sampling (2017)☆21Jan 18, 2019Updated 7 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- krazy grid world☆25Mar 2, 2020Updated 6 years ago
- Unified notation for Markov Decision Processes PO(MDP)s☆24Apr 27, 2018Updated 7 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Jan 12, 2019Updated 7 years ago
- Extreme Learning Machine implementation in Python☆45Jun 22, 2014Updated 11 years ago
- Solving The Malmo Collaborative AI Challenge☆59Jul 23, 2017Updated 8 years ago
- Neuro-evolution for OpenAI Gym environments☆59Feb 26, 2021Updated 5 years ago
- Code for the paper "Uncertainty-Driven Exploration for Generalization in Reinforcement Learning".☆27Jul 6, 2023Updated 2 years ago
- ☆32Mar 19, 2024Updated last year
- some common TD Learning algorithms☆66Mar 6, 2020Updated 5 years ago