CUN-bjy / policy-distillation-baselines
Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.
☆56Updated 3 years ago
Alternatives and similar repositories for policy-distillation-baselines:
Users that are interested in policy-distillation-baselines are comparing it to the libraries listed below
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆117Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆65Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆67Updated 7 months ago
- ☆14Updated 5 years ago
- behavior cloning from observation☆35Updated 4 years ago
- PyTorch code accompanying the paper "Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning" (NeurIPS 2020 spot…☆38Updated last year
- Skill-based Model-based Reinforcement Learning (CoRL 2022)☆55Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆127Updated last year
- RL Algorithms for Visual Continuous Control☆33Updated last year
- Advantage weighted Actor Critic for Offline RL☆50Updated 2 years ago
- ☆47Updated last year
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆31Updated 2 years ago
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆29Updated last year
- CORRO code☆35Updated 2 years ago
- A PyTorch implementation of Implicit Q-Learning☆71Updated 3 years ago
- Official release of CompoSuite, a compositional RL benchmark☆47Updated last year
- A multi-subtask reinforcement learning method where complex tasks can be decomposed into low-level subtasks.☆31Updated 3 years ago
- Simple maze environments using mujoco-py☆54Updated last year
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆33Updated 4 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated 2 months ago
- ☆69Updated 2 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆63Updated 5 years ago
- ☆26Updated last year
- Model-based Policy Gradients☆30Updated 4 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆49Updated 2 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆51Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆158Updated 3 months ago
- Benchmarking Repository for robosuite + SAC☆59Updated 3 years ago
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆64Updated 8 months ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆14Updated 5 years ago