xuanlinli17/iclr2021_rlreg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xuanlinli17/iclr2021_rlreg)

xuanlinli17 / iclr2021_rlreg

Regularization Matters in Policy Optimization

☆21

Alternatives and similar repositories for iclr2021_rlreg

Users that are interested in iclr2021_rlreg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rlseminar / rlseminar.github.io
View on GitHub
Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.
☆21Nov 17, 2023Updated 2 years ago
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
koulanurag / dream-and-search
View on GitHub
Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"
☆12Jul 12, 2021Updated 5 years ago
LeungSamWai / Drop-Activation
View on GitHub
The official implementation of paper "Drop-Activation: Implicit Parameter Reduction and Harmonious Regularization".
☆10May 30, 2019Updated 7 years ago
zzyunzhi / asynch-mb
View on GitHub
(CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning
☆14Dec 27, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mcgillmrl / robot_learning
View on GitHub
ROS package for robot learning
☆17Oct 16, 2019Updated 6 years ago
pairlab / d2rl
View on GitHub
Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"
☆40Jan 22, 2021Updated 5 years ago
ketatam / Exploring-Munchausen-Reinforcement-Learning
View on GitHub
PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces
☆15Oct 3, 2021Updated 4 years ago
stanford-iprl-lab / GRAC
View on GitHub
implementation of our self-guided and self-regularized actor-critic algorithm
☆29Jan 1, 2023Updated 3 years ago
robintyh1 / onpolicybaselines
View on GitHub
on-policy optimization baselines for deep reinforcement learning
☆32Apr 3, 2020Updated 6 years ago
gbup-group / EAN-efficient-attention-network
View on GitHub
The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.
☆20Jun 16, 2023Updated 3 years ago
Qrange-group / Mirror-Gradient
View on GitHub
WWW'24, Mirror Gradient (MG) makes multimodal recommendation models approach flat local minima easier compared to models with normal trai…
☆17Nov 1, 2024Updated last year
leor-c / REM
View on GitHub
Improving Token-Based World Models with Parallel Observation Prediction (ICML 2024)
☆14Feb 23, 2026Updated 5 months ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
microsoft / conservative-uncertainty-estimation-random-priors
View on GitHub
Source code for paper Conservative Uncertainty Estimation By Fitting Prior Networks (ICLR 2020)
☆22Nov 28, 2022Updated 3 years ago
YyzHarry / SV-RL
View on GitHub
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Feb 1, 2020Updated 6 years ago
qgallouedec / lge
View on GitHub
☆33Mar 19, 2024Updated 2 years ago
facebookresearch / reward-estimator-corl
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆23Oct 26, 2018Updated 7 years ago
kaixin96 / mixreg
View on GitHub
Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization
☆34Oct 22, 2020Updated 5 years ago
hsvgbkhgbv / Thermostat-assisted-continuously-tempered-Hamiltonian-Monte-Carlo-for-Bayesian-learning
View on GitHub
Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning
☆10Dec 10, 2018Updated 7 years ago
Santara / stochastic_value_gradient
View on GitHub
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆25Jan 15, 2022Updated 4 years ago
sparisi / td-reg
View on GitHub
TD-Regularized Actor-Critic Methods
☆37Dec 26, 2019Updated 6 years ago
nhynes / abc
View on GitHub
SeqGAN but with more bells and whistles
☆24Feb 15, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
simpler-env / ManiSkill2_real2sim
View on GitHub
SAPIEN ManiSkill2 environments for Real2Sim manipulation policy evaluation
☆20Oct 19, 2024Updated last year
johanobandoc / revisiting_rainbow
View on GitHub
Revisiting Rainbow
☆76Jun 9, 2021Updated 5 years ago
RonanFR / UCRL
View on GitHub
☆27May 17, 2019Updated 7 years ago
basilevh / dissecting-image-crops
View on GitHub
When can you tell whether an image has been cropped or not?
☆29Sep 19, 2021Updated 4 years ago
misterdev / flatland-marl
View on GitHub
Flatland Multi Agent Reinforcement Learning
☆16Aug 1, 2020Updated 5 years ago
google-research / episodic-curiosity
View on GitHub
Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability
☆205Oct 2, 2020Updated 5 years ago
LeungSamWai / Finite-expression-method
View on GitHub
☆21Feb 16, 2026Updated 5 months ago
Universal-Control / ppt_learning
View on GitHub
A unified robotic manipulation learning framework
☆24Sep 4, 2025Updated 10 months ago
tmoer / multimodal_varinf
View on GitHub
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
☆35May 24, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
wataruhashimoto52 / svgd_tf
View on GitHub
Implementation of Stein Variational Gradient Descent with TensorFlow 2.0
☆12Sep 11, 2019Updated 6 years ago
thiagopbueno / tf-mdp
View on GitHub
Probabilistic planning in continuous state-action MDPs in TensorFlow.
☆13Jun 21, 2022Updated 4 years ago
BY571 / D4PG
View on GitHub
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…
☆24Apr 7, 2021Updated 5 years ago
flatwhatson / doom.d
View on GitHub
No Rest for the Living
☆13Nov 13, 2022Updated 3 years ago
jmichaux / intrinsic-motivation
View on GitHub
Using multiple sensor modalities to improve exploration for robotic manipulation tasks with sparse rewards
☆10Sep 17, 2019Updated 6 years ago
alexlee-gk / slac
View on GitHub
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
☆154Oct 26, 2020Updated 5 years ago