Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
☆15Dec 10, 2020Updated 5 years ago
Alternatives and similar repositories for ubisoft-laforge-asaf
Users that are interested in ubisoft-laforge-asaf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- ☆11Jun 8, 2020Updated 5 years ago
- Apprenticeship Learning with Inverse Reinforcement Learning☆29Aug 14, 2021Updated 4 years ago
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆35Jan 3, 2021Updated 5 years ago
- ☆10Mar 11, 2021Updated 5 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- ☆12Nov 18, 2023Updated 2 years ago
- Replication of the paper "Adaptive dropout for training deep neural networks" using Lasagne.☆12Sep 27, 2016Updated 9 years ago
- ☆66May 25, 2020Updated 5 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Implicit Normalizing Flows + Reinforcement Learning☆62May 31, 2019Updated 6 years ago
- ☆24Oct 26, 2021Updated 4 years ago
- ☆12Aug 30, 2024Updated last year
- This is a project based on machine learning and deep learning method for playing Gobang by controlling mechanical arm(利用机械臂下五子棋)☆12Apr 16, 2023Updated 2 years ago
- Code for paper "Model-free Safe Control for Zero-Violation Reinforcement Learning" at Conference on Robot Learning (CoRL) 2021.☆10Nov 1, 2021Updated 4 years ago
- ☆51Nov 26, 2019Updated 6 years ago
- NetVLAD Example on colab☆12Jan 10, 2021Updated 5 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆11Sep 16, 2021Updated 4 years ago
- Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"☆12Apr 23, 2021Updated 4 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 8 years ago
- A group of utilities useful for members of UTCS.☆13Nov 19, 2016Updated 9 years ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated 9 months ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- Implementation of safety augmented value estimation from demonstrations (SAVED)☆24Jul 13, 2019Updated 6 years ago
- [ICLR 2025] "Understanding Constraint Inference in Safety-Critical Inverse Reinforcement Learning"☆13Nov 30, 2025Updated 3 months ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 3 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- ☆12Mar 23, 2018Updated 8 years ago
- behavior cloning from observation☆38Dec 14, 2020Updated 5 years ago
- ☆17Dec 23, 2025Updated 3 months ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- Predicting the medal table of the Summer Games☆12Jul 6, 2023Updated 2 years ago
- Reproduction of the paper "Soft Q-Learning with Mutual Information Regularization" CoRL 2019.☆10Jan 10, 2019Updated 7 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago