Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21
☆22May 20, 2021Updated 4 years ago
Alternatives and similar repositories for SSRR
Users that are interested in SSRR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆51Dec 8, 2022Updated 3 years ago
- ☆21Dec 17, 2020Updated 5 years ago
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆50Jun 18, 2019Updated 6 years ago
- ☆18Apr 20, 2025Updated last year
- Almost Surely Stable Deep Dynamics [NeurIPS 2020]☆12Dec 8, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of Behavioral Cloning from Observationmentation☆16Nov 28, 2019Updated 6 years ago
- ☆10Oct 3, 2023Updated 2 years ago
- ☆42Apr 19, 2026Updated last week
- Public implementation of Heterogeneous Policy Networks (HetNet) from AAMAS'22 -- Paper Title: Learning Efficient Diverse Communication fo…☆21Apr 23, 2024Updated 2 years ago
- The official repo for the CoRL 2022 paper 'Learning Control Admissibility Models with Graph Neural Networks for Multi-Agent Navigation'☆10Oct 8, 2022Updated 3 years ago
- Official codebase for Sirius: Robot Learning on the Job☆66Oct 26, 2023Updated 2 years ago
- Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation (BUDS)☆57Dec 2, 2021Updated 4 years ago
- [ICML 2021] Learning to Weight Imperfect Demonstrations☆20Nov 4, 2022Updated 3 years ago
- ☆16May 1, 2011Updated 14 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Jan 30, 2021Updated 5 years ago
- ☆40Oct 30, 2021Updated 4 years ago
- ☆12Jul 15, 2020Updated 5 years ago
- Inverse Constrained Reinforcement Learning (ICML 2021)☆28Aug 18, 2021Updated 4 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- Code for joint neural network training☆20May 30, 2019Updated 6 years ago
- Public implementation of "Multi-Agent Graph-Attention Communication and Teaming" from AAMAS'21☆95Apr 28, 2024Updated 2 years ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆242Nov 22, 2020Updated 5 years ago
- Code for CORL'18 paper "Risk-Aware Active Inverse Reinforcement Learning"☆16Dec 20, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.☆13Nov 4, 2021Updated 4 years ago
- Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"☆23Sep 7, 2025Updated 7 months ago
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆32Jan 26, 2023Updated 3 years ago
- Dyna built on R-exprs (First Prototype)☆17Mar 7, 2022Updated 4 years ago
- Reading great papers in the history of artificial intelligence and machine learning☆10Oct 26, 2022Updated 3 years ago
- Factored Interactive POMDP solver based on symbolic Perseus.☆11Aug 12, 2025Updated 8 months ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- Fast QP Solver in JAX☆29Aug 29, 2024Updated last year
- A framework for evaluating LLMs in Atari games☆15Apr 21, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- Soft Actor-Critic☆158Mar 13, 2018Updated 8 years ago
- 🔥 open-ss2: a third-party open-source implementation of Figure AI's Helix "System 1, System 2" VLA model for high-rate, dexterous humano…☆11Mar 18, 2025Updated last year
- Teleoperation of LEAP Hand using Apple Vision Pro☆35Dec 24, 2024Updated last year
- Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''☆41Oct 25, 2022Updated 3 years ago
- Spectral Method for Multiple Experts Inverse Reinforcement Learning☆14Sep 6, 2014Updated 11 years ago
- Official Implementation of ICLR2025 Paper: Songyuan Zhang, Oswin So, Mitchell Black, Chuchu Fan: "Discrete GCBF Proximal Policy Optimizat…☆25May 14, 2025Updated 11 months ago