Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.
☆62May 25, 2021Updated 5 years ago
Alternatives and similar repositories for policy-distillation-baselines
Users that are interested in policy-distillation-baselines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 22, 2019Updated 6 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 5 years ago
- Avenue is a simulator designed to test and prototype reinforcement learning algorithms. Avenue is a ServiceNow Research project that was …☆14Jul 15, 2022Updated 3 years ago
- Repository for the IROS 2024 Paper "In-Hand Following of Deformable Linear Objects Using Dexterous Fingers with Tactile Sensing"☆21Dec 24, 2025Updated 6 months ago
- ☆22May 20, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆118Jan 16, 2024Updated 2 years ago
- Unity로 멀티 에이전트 강화학습(MARL) 수행하기 위한 프레임 워크 제공☆24Apr 17, 2022Updated 4 years ago
- Official implementation of the project HuDOR: Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards project. Website:…☆35Apr 10, 2025Updated last year
- ☆17Oct 12, 2023Updated 2 years ago
- Quadcopter control with RL☆16Nov 8, 2021Updated 4 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 7 years ago
- Simulation system for path planning evaluation☆12Dec 13, 2025Updated 6 months ago
- ☆35Jan 29, 2023Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Discrete Event Simulation using Simpy to run model based and model free deep reinforcement learning dispatch policies in a stochastic que…☆21Nov 18, 2018Updated 7 years ago
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆74Jul 23, 2021Updated 4 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Jun 30, 2020Updated 6 years ago
- ☆73Sep 23, 2024Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆91Oct 15, 2023Updated 2 years ago
- ☆25Oct 9, 2024Updated last year
- A project copied from google-research which named motion-imitation was rewrited with PyTorch☆10Sep 30, 2022Updated 3 years ago
- A modified benchmark for designing and controlling 2D Voxel-based Soft Robots☆40Nov 18, 2023Updated 2 years ago
- road-map & paper review for Reinforcement Learning☆47May 30, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- Model-Based Offline Reinforcement Learning☆51Jan 13, 2021Updated 5 years ago
- A mini racetrack world for developing and testing robots with AWS RoboMaker and Gazebo simulations.☆15Sep 8, 2020Updated 5 years ago
- Learning Perceptive Bipedal Locomotion over Irregular Terrain☆25Jun 29, 2023Updated 3 years ago
- ☆57Oct 10, 2025Updated 8 months ago
- Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…☆41Apr 17, 2024Updated 2 years ago
- Official release of CompoSuite, a compositional RL benchmark☆51Jan 27, 2024Updated 2 years ago
- Optimization: principles and algorithms - Michel Bierlaire - EPFL Press - 2015☆14Aug 29, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆28Apr 26, 2024Updated 2 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- gym RL environment in which a mujoco simulation of Agility Robotics' Cassie robot is rewarded for walking/running forward as fast as poss…☆35Nov 17, 2023Updated 2 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 5 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- ☆11Apr 7, 2021Updated 5 years ago