rlai-lab/Regularized-GradientTD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rlai-lab/Regularized-GradientTD)

rlai-lab / Regularized-GradientTD

Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.

☆38

Alternatives and similar repositories for Regularized-GradientTD

Users that are interested in Regularized-GradientTD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sinaghiassian / OffpolicyAlgorithms
View on GitHub
☆23Nov 9, 2021Updated 4 years ago
andnp / PyExpUtils
View on GitHub
Experiment utility code, specifically designed for use with Compute Canada.
☆11Jan 27, 2025Updated last year
emmaajordan / EvaluationOfRLAlgs
View on GitHub
This repository contains the code used in the paper Evaluating the Performance of Reinformcent Learning Algorithms
☆27Aug 14, 2021Updated 4 years ago
atavakol / action-hypergraph-networks
View on GitHub
(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Jun 22, 2021Updated 5 years ago
andnp / rl-control-template
View on GitHub
☆27Mar 11, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zafarali / emdp
View on GitHub
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Apr 1, 2022Updated 4 years ago
rldotai / rl-algorithms
View on GitHub
Reinforcement learning algorithms
☆41Feb 27, 2019Updated 7 years ago
pokaxpoka / netrand
View on GitHub
Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020
☆57Apr 27, 2020Updated 6 years ago
mingen-pan / Reinforcement-Learning-Q-learning-Gridworld-Pytorch
View on GitHub
This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld
☆14Jul 13, 2020Updated 6 years ago
AliKhalili / tensefinder
View on GitHub
Python library for finding English tenses in sentences
☆12Jun 9, 2019Updated 7 years ago
mkschleg / GVFN
View on GitHub
☆10Apr 24, 2021Updated 5 years ago
liziniu / HyperDQN
View on GitHub
Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)
☆12Nov 28, 2023Updated 2 years ago
pierrelux / rlbook
View on GitHub
A graduate-level introduction to reinforcement learning as a framework for modeling, optimization, and control, connecting dynamic models…
☆18Dec 9, 2025Updated 7 months ago
kenjyoung / MinAtar
View on GitHub
☆333Dec 19, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
keynans / HypeRL
View on GitHub
Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)
☆26Jun 9, 2021Updated 5 years ago
abbyvansoest / maxent
View on GitHub
☆14May 30, 2019Updated 7 years ago
makokal / MDPN
View on GitHub
Unified notation for Markov Decision Processes PO(MDP)s
☆24Apr 27, 2018Updated 8 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
rltheorybook / rltheorybook.github.io
View on GitHub
☆29Jun 27, 2026Updated 3 weeks ago
apexrl / autombpo
View on GitHub
Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>
☆13Nov 16, 2021Updated 4 years ago
pomonam / NoisyNaturalGradient
View on GitHub
TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".
☆61Jan 12, 2019Updated 7 years ago
sail-sg / offbench
View on GitHub
☆16Jun 1, 2023Updated 3 years ago
Shallow-Updates-for-Deep-RL / Shallow_Updates_for_Deep_RL
View on GitHub
Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"
☆18Nov 2, 2017Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
robintyh1 / icml2021-pengqlambda
View on GitHub
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
☆15Jul 23, 2021Updated 5 years ago
marcharper / pyed
View on GitHub
Computes trajectories for evolutionary dynamics.
☆15Oct 6, 2020Updated 5 years ago
qgallouedec / lge
View on GitHub
☆33Mar 19, 2024Updated 2 years ago
ShangtongZhang / rebib
View on GitHub
Retrieve information from DBLP and update BibTex files automatically
☆53Jun 4, 2022Updated 4 years ago
rraileanu / auto-drac
View on GitHub
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆104Mar 24, 2023Updated 3 years ago
thanhnguyentang / mmdrl
View on GitHub
Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354
☆30Jul 14, 2021Updated 5 years ago
montrealrobotics / iv_rl
View on GitHub
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Jul 18, 2025Updated last year
deep-skill-chaining / deep-skill-chaining
View on GitHub
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆30Sep 24, 2019Updated 6 years ago
manantomar / Mirror-Descent-Policy-Optimization
View on GitHub
Mirror Descent Policy Optimization
☆43Oct 31, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
brain-research / mirage-rl
View on GitHub
Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.
☆17Aug 2, 2018Updated 7 years ago
chrodan / tdlearn
View on GitHub
some common TD Learning algorithms
☆66Mar 6, 2020Updated 6 years ago
ermongroup / best-arm-delayed
View on GitHub
Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.
☆20Apr 3, 2018Updated 8 years ago
bstadie / krazyworld
View on GitHub
krazy grid world
☆26Mar 2, 2020Updated 6 years ago
jind11 / utterance-rewriting
View on GitHub
This repository releases the code and data for utterance rewriting in open-domain dialogues.
☆18Feb 24, 2023Updated 3 years ago
mlii / mvrl
View on GitHub
Multi-view Reinforcement Learning
☆11Feb 9, 2020Updated 6 years ago
martinseilair / dm_control2gym
View on GitHub
OpenAI Gym Wrapper for DeepMind Control Suite
☆74Nov 30, 2021Updated 4 years ago