RLAgent/state-marginal-matching

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RLAgent/state-marginal-matching)

RLAgent / state-marginal-matching

Efficient Exploration via State Marginal Matching (2019)

☆70

Alternatives and similar repositories for state-marginal-matching

Users that are interested in state-marginal-matching are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ben-eysenbach / sac
View on GitHub
Soft Actor-Critic
☆161Mar 13, 2018Updated 8 years ago
ruizhaogit / EnergyBasedPrioritization
View on GitHub
Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)
☆35Nov 28, 2018Updated 7 years ago
junhyukoh / self-imitation-learning
View on GitHub
ICML 2018 Self-Imitation Learning
☆277Apr 18, 2020Updated 6 years ago
tgangwani / SelfImitationDiverse
View on GitHub
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
☆20Nov 26, 2020Updated 5 years ago
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
KamyarGh / rl_swiss
View on GitHub
☆66May 25, 2020Updated 6 years ago
vub-ai-lab / bdpi
View on GitHub
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Sep 9, 2019Updated 6 years ago
DavidJanz / successor_uncertainties_atari
View on GitHub
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Feb 24, 2023Updated 3 years ago
mcgillmrl / robot_learning
View on GitHub
ROS package for robot learning
☆17Oct 16, 2019Updated 6 years ago
ben-eysenbach / mnm
View on GitHub
Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"
☆21Oct 6, 2021Updated 4 years ago
orybkin / lexa-benchmark
View on GitHub
☆42May 11, 2022Updated 4 years ago
dyne-submission / dynamics-aware-embeddings
View on GitHub
☆16Sep 25, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
seungyulhan / disc
View on GitHub
☆10Aug 17, 2022Updated 3 years ago
RuohanW / RED
View on GitHub
Implementation of Random Expert Distillation
☆29May 11, 2019Updated 7 years ago
youngwoon / transition
View on GitHub
Official code for the paper "Learning Transition Policies for Composing Complex Skills" (ICLR 2019)
☆77Apr 29, 2019Updated 7 years ago
pathak22 / modular-assemblies
View on GitHub
[NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"
☆120Dec 13, 2019Updated 6 years ago
zchuning / repo
View on GitHub
Resilient Model-Based RL by Regularizing Posterior Predictability
☆22Mar 4, 2024Updated 2 years ago
denisyarats / drq
View on GitHub
DrQ: Data regularized Q
☆422Jan 13, 2023Updated 3 years ago
denisyarats / dmc2gym
View on GitHub
OpenAI Gym wrapper for the DeepMind Control Suite
☆229May 19, 2024Updated 2 years ago
alexlee-gk / slac
View on GitHub
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
☆154Oct 26, 2020Updated 5 years ago
stepjam / BPP
View on GitHub
Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning
☆27Feb 8, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
MishaLaskin / curl
View on GitHub
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
☆605Oct 28, 2020Updated 5 years ago
cjlovering / Towards-Interpretable-Reinforcement-Learning-Using-Attention-Augmented-Agents-Replication
View on GitHub
☆22Oct 4, 2019Updated 6 years ago
zzyunzhi / vds
View on GitHub
Code for Automatic Curriculum Learning through Value Disagreement
☆32Jun 15, 2020Updated 6 years ago
bhyang / replab
View on GitHub
https://sites.google.com/view/replab/
☆25Mar 24, 2023Updated 3 years ago
haje01 / distper
View on GitHub
Distributed Priortized Experience Replay
☆10Aug 8, 2018Updated 7 years ago
pathak22 / exploration-by-disagreement
View on GitHub
[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement
☆131Jun 11, 2019Updated 7 years ago
Coac / never-give-up
View on GitHub
PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies
☆57Jan 22, 2021Updated 5 years ago
apexrl / bmpo
View on GitHub
Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Mar 24, 2023Updated 3 years ago
deep-skill-chaining / deep-skill-chaining
View on GitHub
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆30Sep 24, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mugen-org / MUGEN_coinrun
View on GitHub
A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset. This repo contains scripts …
☆13Jul 13, 2022Updated 4 years ago
openai / EPG
View on GitHub
Code for the paper "Evolved Policy Gradients"
☆253Nov 22, 2018Updated 7 years ago
avisingh599 / cog
View on GitHub
[CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning
☆35Oct 28, 2020Updated 5 years ago
rraileanu / idaac
View on GitHub
☆55Feb 28, 2024Updated 2 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
aviralkumar2907 / BEAR
View on GitHub
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆164Jul 17, 2020Updated 6 years ago
jparkerholder / DvD_ES
View on GitHub
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆46Oct 29, 2020Updated 5 years ago