ICML 2018 Self-Imitation Learning
☆277Apr 18, 2020Updated 5 years ago
Alternatives and similar repositories for self-imitation-learning
Users that are interested in self-imitation-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆67Nov 4, 2018Updated 7 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- [ICLR 2018] TensorFlow code for zero-shot visual imitation by self-supervised exploration☆203May 30, 2018Updated 7 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- NIPS 2017 Value Prediction Network☆167Jan 12, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆207Nov 22, 2018Updated 7 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆731Nov 22, 2018Updated 7 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- Code for the paper "Evolved Policy Gradients"☆254Nov 22, 2018Updated 7 years ago
- Code for hierarchical imitation learning and reinforcement learning☆301Mar 14, 2018Updated 8 years ago
- A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.☆1,022Mar 13, 2019Updated 7 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Mar 18, 2019Updated 7 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Publicly releasable baselines for the Retro contest☆130Nov 22, 2018Updated 7 years ago
- Code for the paper "Exploration by Random Network Distillation"☆934Oct 1, 2020Updated 5 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆310Apr 13, 2023Updated 2 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆248Sep 30, 2022Updated 3 years ago
- Code for "One-Shot Visual Imitation Learning via Meta-Learning"☆291Oct 8, 2018Updated 7 years ago
- SplitNet implemented based on ResNet-50 trained on ImageNet-22K☆16Jun 18, 2018Updated 7 years ago
- An official TensorFlow implementation of "Neural Program Synthesis from Diverse Demonstration Videos" (ICML 2018) by Shao-Hua Sun, Hyeonw…☆102Mar 24, 2023Updated 3 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- ☆62Jun 22, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Implementation of PPO in Pytorch☆41Dec 6, 2017Updated 8 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆375Oct 15, 2021Updated 4 years ago
- Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch☆877Dec 27, 2022Updated 3 years ago
- Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings☆96Jun 8, 2018Updated 7 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆224Mar 29, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Collection of reinforcement learning algorithms☆2,884Jun 17, 2024Updated last year
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆117Dec 13, 2019Updated 6 years ago
- Soft Actor-Critic☆1,230Nov 29, 2023Updated 2 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,473Dec 7, 2022Updated 3 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆211Feb 22, 2019Updated 7 years ago
- Code for the paper "Meta-Learning Shared Hierarchies"☆618Jul 6, 2023Updated 2 years ago
- Stein Variational Policy Gradient for REINFORCE☆18Jul 12, 2017Updated 8 years ago