Official implementation of ICML paper Imitating Latent Policies from Observation
☆75May 13, 2019Updated 6 years ago
Alternatives and similar repositories for ILPO
Users that are interested in ILPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch official implementation for Imitating Unknown Policies via Exploration.☆14Oct 3, 2023Updated 2 years ago
- Implementation of Behavioral Cloning from Observationmentation☆16Nov 28, 2019Updated 6 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20May 7, 2025Updated 11 months ago
- Learning to Imitate Behaviors from Raw Video via Context Translation☆53Dec 11, 2017Updated 8 years ago
- [WIP] Playing Hard Exploration Games by Watching YouTube (Aytar et al., 2018)☆12Jan 31, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- MImE - Manipulation Imitation Environments☆14Feb 1, 2022Updated 4 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- This is a project for creating and using IL datasets based on HuggingFace weights with multithreads for performance, and benchmarking☆13Mar 10, 2026Updated last month
- Official code for "Task-Embedded Control Networks for Few-Shot Imitation Learning".☆46Nov 29, 2019Updated 6 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆19Mar 1, 2021Updated 5 years ago
- A collection of notebooks to show examples of using robosuite v1.0☆10Sep 6, 2020Updated 5 years ago
- Probabilistic inference for models of behaviour☆13Mar 5, 2026Updated last month
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Learning from Trajectories via Subgoal Discovery☆12Dec 10, 2020Updated 5 years ago
- Adversarial Imitation Via Variational Inverse Reinforcement Learning☆96Dec 30, 2019Updated 6 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"☆11Apr 8, 2025Updated last year
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated 11 months ago
- ☆25Jan 2, 2019Updated 7 years ago
- Code for "One-Shot Visual Imitation Learning via Meta-Learning"☆290Oct 8, 2018Updated 7 years ago
- ☆330Dec 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Jan 27, 2026Updated 3 months ago
- ☆14Oct 7, 2022Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Bayesian model reduction for probabilistic machine learning☆11Jul 3, 2025Updated 10 months ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- ☆28Jul 28, 2022Updated 3 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Apr 4, 2021Updated 5 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Jul 18, 2022Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆171Jun 23, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆38Jun 3, 2023Updated 2 years ago
- Code for "Unsupervised State Representation Learning in Atari"☆257Nov 2, 2023Updated 2 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated last year
- Dynamic Algorithm Configuration☆21Jan 22, 2020Updated 6 years ago
- Official code for Slot-Transformer for Videos (STEVE)☆51Jan 9, 2023Updated 3 years ago
- https://sites.google.com/view/replab/☆25Mar 24, 2023Updated 3 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 8 years ago