Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
☆15Dec 10, 2020Updated 5 years ago
Alternatives and similar repositories for ubisoft-laforge-asaf
Users that are interested in ubisoft-laforge-asaf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- ☆12Jun 8, 2020Updated 6 years ago
- Apprenticeship Learning with Inverse Reinforcement Learning☆28Aug 14, 2021Updated 4 years ago
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆35Jan 3, 2021Updated 5 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆31Jul 1, 2019Updated 6 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- ☆11Nov 18, 2023Updated 2 years ago
- Replication of the paper "Adaptive dropout for training deep neural networks" using Lasagne.☆12Sep 27, 2016Updated 9 years ago
- ☆66May 25, 2020Updated 6 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- ☆24Oct 26, 2021Updated 4 years ago
- Reinforcement Learning Project☆12Jan 16, 2017Updated 9 years ago
- Pytorch implementation of InfoGAIL and WGAIL☆19Oct 7, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- This is a project based on machine learning and deep learning method for playing Gobang by controlling mechanical arm(利用机械臂下五子棋)☆13Apr 16, 2023Updated 3 years ago
- ☆18Oct 28, 2023Updated 2 years ago
- PyTorch code for TAPAS-GMM.☆15Nov 21, 2024Updated last year
- Code for paper "Model-free Safe Control for Zero-Violation Reinforcement Learning" at Conference on Robot Learning (CoRL) 2021.☆10Nov 1, 2021Updated 4 years ago
- ☆51Nov 26, 2019Updated 6 years ago
- NetVLAD Example on colab☆12Jan 10, 2021Updated 5 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 9 years ago
- A group of utilities useful for members of UTCS.☆13Nov 19, 2016Updated 9 years ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated last year
- Implementation of safety augmented value estimation from demonstrations (SAVED)☆24Jul 13, 2019Updated 6 years ago
- Code to reproduce paper results (or as close as possible, depending on data-availability). Each publication has a Jupyter notebook. Mostl…☆12Mar 8, 2024Updated 2 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 3 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Mar 23, 2018Updated 8 years ago
- behavior cloning from observation☆38Dec 14, 2020Updated 5 years ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- ☆16Feb 26, 2024Updated 2 years ago
- Predicting the medal table of the Summer Games☆12Jul 6, 2023Updated 2 years ago
- ☆13Apr 11, 2022Updated 4 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago