Source code to the AAAI21 publication Augmenting Policy Learning with Routines Discovered from a Single Demonstration
☆17Jan 7, 2021Updated 5 years ago
Alternatives and similar repositories for AAAI21-RoutineAugmentedPolicyLearning
Users that are interested in AAAI21-RoutineAugmentedPolicyLearning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆11Oct 22, 2020Updated 5 years ago
- The official code of paper "OMS-DPM: Optimizing Model Schedule for Diffusion Probabilistic Model" accepted by ICML 2023☆24Oct 11, 2023Updated 2 years ago
- Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions☆13May 22, 2023Updated 3 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- This is the dataset generation code for ADEPT (Approximate Derenderer, Extended Physics, and Tracking). http://physadept.csail.mit.edu/☆15Sep 26, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆49Apr 22, 2013Updated 13 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- ☆11Mar 23, 2022Updated 4 years ago
- reveal-md is great project. Improve your presentation even more with custom user scripts. Here is the place to find them.☆15Dec 7, 2023Updated 2 years ago
- [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding☆44Mar 15, 2024Updated 2 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- Deep Reinforcement Learning by using Phasic Policy Gradient in Pytorch & Tensorflow☆20Oct 5, 2021Updated 4 years ago
- Probabilistic logic language for inference, planning and learning in static and dynamic domains☆15Feb 27, 2017Updated 9 years ago
- Code for RepNAS☆14Dec 21, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆15Feb 25, 2020Updated 6 years ago
- automated planning toolbox☆15Jun 5, 2017Updated 9 years ago
- Code for Paper "State Alignment-based Imitation Learning". Under maintenance☆17May 1, 2020Updated 6 years ago
- Wasserstein Distance guided Adversarial Imitation Learning (WDAIL) with Reward Shape Exploration☆19Feb 9, 2021Updated 5 years ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆20Feb 29, 2020Updated 6 years ago
- ☆26Jun 13, 2023Updated 3 years ago
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- ☆16Mar 15, 2024Updated 2 years ago
- Planning through backpropagation using TensorFlow.☆16Oct 29, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Notebooks for managing NeurIPS 2014 and analysing the NeurIPS experiment.☆13May 22, 2024Updated 2 years ago
- The official code for our ECCV22 oral paper: tracking objects as pixel-wise distributions.☆159Sep 21, 2022Updated 3 years ago
- Official TensorFlow implementation for "Supervised Domain Adaptation: A Graph Embedding Perspective and a Rectified Experimental Protocol…☆17Mar 25, 2023Updated 3 years ago
- ☆36Feb 18, 2026Updated 4 months ago
- ACPBench: Reasoning about Action, Change, and Planning. A benchmark designed to evaluate the fundamental reasoning abilities in the dom…☆33Feb 11, 2026Updated 4 months ago
- Local Attention - Flax module for Jax☆22May 26, 2021Updated 5 years ago
- Weakly Supervised Text-to-SQL Parsing through Question Decomposition☆23Nov 22, 2023Updated 2 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- ☆68Sep 28, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Verify naive = datafrog-opt, in rust/polonius☆16Jun 26, 2025Updated last year
- Code for WACV 2023 paper "Out-of-distribution Detection via Frequency-regularized Generative Models" by Mu Cai and Yixuan Li☆11May 1, 2023Updated 3 years ago
- Code for ICCV2021 paper: Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images☆15Jan 24, 2023Updated 3 years ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆97Mar 10, 2025Updated last year
- Annotated Minecraft dataset for machine learning☆13Nov 13, 2015Updated 10 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 8 years ago
- A benchmark library.☆15Oct 3, 2020Updated 5 years ago