Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python code and datasets used in the experiments.
☆20Aug 16, 2021Updated 4 years ago
Alternatives and similar repositories for vild_code
Users that are interested in vild_code are comparing it to the libraries listed below
Sorting:
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- [IJCAI 2021] Robust Adversarial Imitation Learning via Adaptively-Selected Demonstrations☆16Feb 17, 2023Updated 3 years ago
- Multi-Modal Imitation Learning in Partially Observable Environments☆13Sep 5, 2020Updated 5 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆51Dec 8, 2022Updated 3 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆49Jun 18, 2019Updated 6 years ago
- Learning Virtual Grasp with Failed Demonstrations via Bayesian Inverse Reinforcement Learning (IROS 2019)☆14Nov 4, 2019Updated 6 years ago
- ICML'20: Intrinsic Reward Driven Imitation Learning via Generative Model☆15Nov 5, 2021Updated 4 years ago
- Wasserstein Distance guided Adversarial Imitation Learning (WDAIL) with Reward Shape Exploration☆18Feb 9, 2021Updated 5 years ago
- Code for Paper "State Alignment-based Imitation Learning". Under maintenance☆17May 1, 2020Updated 5 years ago
- ☆21Dec 17, 2020Updated 5 years ago
- Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.☆27May 4, 2021Updated 4 years ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆20Feb 29, 2020Updated 6 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Jun 20, 2019Updated 6 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 7 years ago
- Repo for Bring Your Own Vision-Language-Action (VLA) model, arxiv 2024☆36Jan 22, 2025Updated last year
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆76Mar 16, 2023Updated 2 years ago
- Generate expert demonstrations; GAIL(Generative Adversarial Imitation Learning); IRL(Inverse Reinforcement Learning)☆32Aug 11, 2021Updated 4 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- ☆35Mar 10, 2025Updated 11 months ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Implementation for the CVPR2019 paper "Graphical Contrastive Losses for Scene Graph Parsing"☆12Nov 11, 2019Updated 6 years ago
- Pythonによる制御工学入門改訂2版☆12Aug 22, 2024Updated last year
- Planning, inverse planning, and inference in planning, using PDDL and Gen.☆41Jul 1, 2024Updated last year
- Simple implementation of dynamic movement primitives (DMP) in python☆11Jun 23, 2013Updated 12 years ago
- Goal-conditioned reinforcement learning like 🔥☆13Feb 3, 2024Updated 2 years ago
- Online Spatial Concept and Lexical Acquisition with Simultaneous Localization and Mapping☆10Sep 11, 2020Updated 5 years ago
- On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)☆11Nov 20, 2017Updated 8 years ago
- ☆14Jan 27, 2026Updated last month
- ☆22Jan 12, 2026Updated last month
- [RAL 2025] MTIL: Encoding Full History with Mamba for Temporal Imitation Learning☆27Nov 17, 2025Updated 3 months ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- Evidential Calibration☆11Mar 8, 2022Updated 3 years ago
- A small library of 3D related utilities used in my research.☆10Mar 5, 2022Updated 3 years ago
- Optimized dqn for caffe☆11Dec 18, 2015Updated 10 years ago