IrisLi17 / onpolicy_algorithmLinks
☆17Updated 2 years ago
Alternatives and similar repositories for onpolicy_algorithm
Users that are interested in onpolicy_algorithm are comparing it to the libraries listed below
Sorting:
- Benchmarking Repository for robosuite + SAC☆66Updated 4 years ago
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆143Updated 2 years ago
- Paper Collection for Imitation Learning in RL.☆154Updated 3 years ago
- This is the repo of "RL-ViGen: A Reinforcement Learning Benchmark for Visual Generalization"☆111Updated 11 months ago
- [ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.☆83Updated 5 months ago
- ☆121Updated 2 years ago
- ☆55Updated 2 years ago
- ☆44Updated last year
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆64Updated 3 years ago
- Code for https://jangirrishabh.github.io/lookcloser/☆42Updated 2 years ago
- ☆70Updated last year
- Code for Transfering Hierarchical Structure with Dual Meta Imitation Learning.☆17Updated 4 years ago
- ☆56Updated 4 years ago
- Official codebase for Manipulation Primitive-augmented reinforcement Learning (MAPLE)☆96Updated 3 years ago
- ☆11Updated 3 years ago
- ICML'20: Intrinsic Reward Driven Imitation Learning via Generative Model☆15Updated 4 years ago
- ReDMan is an open-source simulation platform that provides a standardized implementation of safe RL algorithms for Reliable Dexterous Man…☆21Updated 2 years ago
- official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)☆117Updated last year
- [ICML2025] Official implementation of Efficient Online Reinforcement Learning for Diffusion Policies appearing in ICML 2025.☆46Updated last week
- A PyTorch implementation of Implicit Behavioral Cloning☆111Updated 3 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆133Updated 4 years ago
- Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation (BUDS)☆57Updated 4 years ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆78Updated last year
- Public implementation of "Learning from Suboptimal Demonstration via Self-Supervised Reward Regression" from CoRL'21☆23Updated 4 years ago
- Official repository for "VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training"☆179Updated 2 years ago
- A benchmark for offline goal-conditioned RL and offline RL☆324Updated 3 weeks ago
- ☆25Updated 3 years ago
- ☆132Updated 5 years ago
- Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport☆83Updated 2 years ago
- ☆60Updated last week