Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21
☆22May 20, 2021Updated 5 years ago
Alternatives and similar repositories for SSRR
Users that are interested in SSRR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆51Dec 8, 2022Updated 3 years ago
- ☆21Dec 17, 2020Updated 5 years ago
- [ICML 2019] Implementation of "Imitation Learning from Imperfect Demonstration"☆50Jun 18, 2019Updated 7 years ago
- ☆18Jun 20, 2026Updated last week
- Almost Surely Stable Deep Dynamics [NeurIPS 2020]☆12Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of Behavioral Cloning from Observationmentation☆16Nov 28, 2019Updated 6 years ago
- ☆10Oct 3, 2023Updated 2 years ago
- ☆43Apr 19, 2026Updated 2 months ago
- The official repo for the CoRL 2022 paper 'Learning Control Admissibility Models with Graph Neural Networks for Multi-Agent Navigation'☆10Oct 8, 2022Updated 3 years ago
- Official codebase for Sirius: Robot Learning on the Job☆66Oct 26, 2023Updated 2 years ago
- Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation (BUDS)☆57Dec 2, 2021Updated 4 years ago
- [ICML 2021] Learning to Weight Imperfect Demonstrations☆20Nov 4, 2022Updated 3 years ago
- ☆13Jan 30, 2021Updated 5 years ago
- ☆40Oct 30, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Jul 15, 2020Updated 5 years ago
- Inverse Constrained Reinforcement Learning (ICML 2021)☆28Aug 18, 2021Updated 4 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- Public implementation of "Multi-Agent Graph-Attention Communication and Teaming" from AAMAS'21☆99Apr 28, 2024Updated 2 years ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆241Nov 22, 2020Updated 5 years ago
- generative models on toys☆12Sep 10, 2024Updated last year
- Code for CORL'18 paper "Risk-Aware Active Inverse Reinforcement Learning"☆17Dec 20, 2018Updated 7 years ago
- Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.☆13Nov 4, 2021Updated 4 years ago
- ☆12Apr 1, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TACO-RL: Latent Plans for Task-Agnostic Offline Reinforcement Learning☆32Jan 26, 2023Updated 3 years ago
- a jax benchmark for ad hoc teamwork☆21Jun 7, 2026Updated 3 weeks ago
- Dyna built on R-exprs (First Prototype)☆17Mar 7, 2022Updated 4 years ago
- Apply major Reinforcement Learning algorithms (DQN,PPO,A2C) to CarRacing-v0 from GymAI environment.☆28Jan 4, 2022Updated 4 years ago
- Reading great papers in the history of artificial intelligence and machine learning☆10Oct 26, 2022Updated 3 years ago
- Factored Interactive POMDP solver based on symbolic Perseus.☆11Aug 12, 2025Updated 10 months ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- [NeurIPS '25] FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed☆29Mar 9, 2026Updated 3 months ago
- A framework for evaluating LLMs in Atari games☆15Apr 21, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- Soft Actor-Critic☆159Mar 13, 2018Updated 8 years ago
- 🔥 open-ss2: a third-party open-source implementation of Figure AI's Helix "System 1, System 2" VLA model for high-rate, dexterous humano…☆11Mar 18, 2025Updated last year
- Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''☆41Oct 25, 2022Updated 3 years ago
- [MICCAI 2024 workshop] Official implementation of "SemiT-SAM: Building a Visual Foundation Model for Tooth Instance Segmentation on Panor…☆15Nov 13, 2024Updated last year
- Spectral Method for Multiple Experts Inverse Reinforcement Learning☆14Sep 6, 2014Updated 11 years ago
- Official Implementation of ICLR2025 Paper: Songyuan Zhang, Oswin So, Mitchell Black, Chuchu Fan: "Discrete GCBF Proximal Policy Optimizat…☆29May 14, 2025Updated last year