google-research/policy-learning-landscape

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research/policy-learning-landscape)

google-research / policy-learning-landscape

Explore the optimization landscape for direct policy learning reinforcement learning.

☆51

Alternatives and similar repositories for policy-learning-landscape

Users that are interested in policy-learning-landscape are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / reward-estimator-corl
View on GitHub
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆23Oct 26, 2018Updated 7 years ago
zafarali / policy-gradient-methods
View on GitHub
Modular PyTorch implementation of policy gradient methods
☆24Nov 15, 2018Updated 7 years ago
zafarali / emdp
View on GitHub
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Apr 1, 2022Updated 4 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
ShibiHe / Q-Optimality-Tightening
View on GitHub
This is my implementation of the Optimality Tightening
☆37Apr 26, 2017Updated 9 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
supratikp / HOOF
View on GitHub
Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583
☆19Oct 22, 2019Updated 6 years ago
zamlz / dlcampjeju2018-I2A-cube
View on GitHub
Applying Imagination-Augmented Agents for Deep Reinforcement Learning to the Rubik's Cube
☆16Jul 26, 2018Updated 7 years ago
camillol / MacTorcs
View on GitHub
Mac port of Torcs, The Open Racing Car Simulator
☆11Jun 16, 2010Updated 16 years ago
XiaoxiaoGuo / atari_uct
View on GitHub
Upper Confidence Tree Planner for ATARI games
☆19Mar 9, 2016Updated 10 years ago
brain-research / mirage-rl
View on GitHub
Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.
☆17Aug 2, 2018Updated 7 years ago
facebookresearch / slbo
View on GitHub
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆94Sep 13, 2019Updated 6 years ago
illidanlab / cdrl
View on GitHub
Collaborative Deep Reinforcement Learning
☆32Jul 29, 2017Updated 8 years ago
evgenii-nikishin / omd
View on GitHub
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
☆43Jun 14, 2021Updated 5 years ago
montrealrobotics / iv_rl
View on GitHub
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆40Jul 18, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lcalem / reproduction-soft-qlearning-mutual-information
View on GitHub
Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.
☆10Jan 10, 2019Updated 7 years ago
Breakend / DeepReinforcementLearningThatMatters
View on GitHub
Accompanying code for "Deep Reinforcement Learning that Matters"
☆154Sep 22, 2017Updated 8 years ago
shamanez / VUSFA-Variational-Universal-Successor-Features-Approximator
View on GitHub
This repository contains implementations of the paper VUSFA
☆14Mar 31, 2021Updated 5 years ago
uber-research / ape-x
View on GitHub
This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"
☆190Mar 18, 2019Updated 7 years ago
veronicachelu / temporal_abstraction
View on GitHub
Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…
☆24Nov 29, 2018Updated 7 years ago
DuaneNielsen / rnd
View on GitHub
Exploration by Random Network Distillation
☆15Dec 30, 2018Updated 7 years ago
astier / model-free-episodic-control
View on GitHub
Model-Free-Episodic-Control implementation.
☆17Jun 3, 2019Updated 7 years ago
ermongroup / CalibratedModelBasedRL
View on GitHub
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆54May 15, 2019Updated 7 years ago
cshenton / auto-encoding-variational-bayes
View on GitHub
Replication of "Auto-Encoding Variational Bayes" (Kingma & Welling, 2013)
☆20Mar 8, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
oxwhirl / treeqn
View on GitHub
☆93Nov 15, 2019Updated 6 years ago
higgsfield / Imagination-Augmented-Agents
View on GitHub
Building Agents with Imagination: pytorch step-by-step implementation
☆213Feb 22, 2019Updated 7 years ago
ling-pan / RES
View on GitHub
☆25Feb 21, 2022Updated 4 years ago
ruizhaogit / mep
View on GitHub
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24May 30, 2019Updated 7 years ago
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
alexrutar / banditvis
View on GitHub
A Python 3 Bandit Visualization Package
☆11Oct 16, 2017Updated 8 years ago
dibyaghosh / dnc
View on GitHub
Code for "Divide-and-Conquer Reinforcement Learning"
☆63Jan 8, 2019Updated 7 years ago
koulanurag / dream-and-search
View on GitHub
Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"
☆12Jul 12, 2021Updated 5 years ago
sparisi / td-reg
View on GitHub
TD-Regularized Actor-Critic Methods
☆37Dec 26, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kevindegila / flask-joey
View on GitHub
A Simple Flask App to interact with your Machine Translation Model
☆13Feb 26, 2020Updated 6 years ago
aviralkumar2907 / BEAR
View on GitHub
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆164Jul 17, 2020Updated 6 years ago
citizenhicks / openai_DDDQN
View on GitHub
☆14Mar 24, 2021Updated 5 years ago
yudasong / briee
View on GitHub
Representation Learning in RL
☆13Jun 1, 2022Updated 4 years ago
akhilthomas17 / reinforced_visual_slam
View on GitHub
Reinforcement for state of the art visual slam algorithms with Deep Learning based solutions. Part of my master thesis at University of F…
☆10Jul 29, 2018Updated 7 years ago
verlab / SceneUnderstanding_CIARP_2017
View on GitHub
☆12Apr 23, 2018Updated 8 years ago
mabirck / AttentionTRL
View on GitHub
Attentional Mechanism incorporated in Asynchronous Advantage Actor Critic a3c/a2c deep mind
☆10Jan 9, 2018Updated 8 years ago