junhyukoh / self-imitation-learningView external linksLinks
ICML 2018 Self-Imitation Learning
☆278Apr 18, 2020Updated 5 years ago
Alternatives and similar repositories for self-imitation-learning
Users that are interested in self-imitation-learning are comparing it to the libraries listed below
Sorting:
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆67Nov 4, 2018Updated 7 years ago
- [ICLR 2018] TensorFlow code for zero-shot visual imitation by self-supervised exploration☆203May 30, 2018Updated 7 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆435Nov 28, 2023Updated 2 years ago
- NIPS 2017 Value Prediction Network☆167Jan 12, 2018Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆206Nov 22, 2018Updated 7 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- Code for hierarchical imitation learning and reinforcement learning☆301Mar 14, 2018Updated 7 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Mar 18, 2019Updated 6 years ago
- Code for the paper "Evolved Policy Gradients"☆253Nov 22, 2018Updated 7 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆729Nov 22, 2018Updated 7 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆1,019Mar 13, 2019Updated 6 years ago
- Implementation of PPO in Pytorch☆41Dec 6, 2017Updated 8 years ago
- An official TensorFlow implementation of "Neural Program Synthesis from Diverse Demonstration Videos" (ICML 2018) by Shao-Hua Sun, Hyeonw…☆102Mar 24, 2023Updated 2 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆246Sep 30, 2022Updated 3 years ago
- Code for "One-Shot Visual Imitation Learning via Meta-Learning"☆292Oct 8, 2018Updated 7 years ago
- Publicly releasable baselines for the Retro contest☆129Nov 22, 2018Updated 7 years ago
- Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch☆875Dec 27, 2022Updated 3 years ago
- Code for the paper "Exploration by Random Network Distillation"☆931Oct 1, 2020Updated 5 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆209Feb 22, 2019Updated 6 years ago
- SplitNet implemented based on ResNet-50 trained on ImageNet-22K☆16Jun 18, 2018Updated 7 years ago
- ☆62Jun 22, 2018Updated 7 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆117Dec 13, 2019Updated 6 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Oct 26, 2020Updated 5 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆309Apr 13, 2023Updated 2 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings☆96Jun 8, 2018Updated 7 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆374Oct 15, 2021Updated 4 years ago
- Soft Actor-Critic☆1,206Nov 29, 2023Updated 2 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,471Dec 7, 2022Updated 3 years ago
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆225Mar 29, 2017Updated 8 years ago
- Implementation of "Action-Conditional Video Prediction using Deep Networks in Atari Games"☆114Feb 8, 2016Updated 10 years ago
- Inferring beliefs about dynamics from behavior☆30May 24, 2018Updated 7 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Jun 18, 2019Updated 6 years ago