Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.
☆61May 25, 2021Updated 5 years ago
Alternatives and similar repositories for policy-distillation-baselines
Users that are interested in policy-distillation-baselines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 22, 2019Updated 6 years ago
- Avenue is a simulator designed to test and prototype reinforcement learning algorithms. Avenue is a ServiceNow Research project that was …☆14Jul 15, 2022Updated 3 years ago
- Repository for the IROS 2024 Paper "In-Hand Following of Deformable Linear Objects Using Dexterous Fingers with Tactile Sensing"☆20Dec 24, 2025Updated 5 months ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- ☆22May 20, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆116Jan 16, 2024Updated 2 years ago
- Unity로 멀티 에이전트 강화학습(MARL) 수행하기 위한 프레임 워크 제공☆24Apr 17, 2022Updated 4 years ago
- Official implementation of the project HuDOR: Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards project. Website:…☆34Apr 10, 2025Updated last year
- Quadcopter control with RL☆16Nov 8, 2021Updated 4 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 6 years ago
- ☆35Jan 29, 2023Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆73Jul 23, 2021Updated 4 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Jun 30, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆71Sep 23, 2024Updated last year
- ☆20Dec 2, 2024Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆90Oct 15, 2023Updated 2 years ago
- A project copied from google-research which named motion-imitation was rewrited with PyTorch☆10Sep 30, 2022Updated 3 years ago
- A modified benchmark for designing and controlling 2D Voxel-based Soft Robots☆40Nov 18, 2023Updated 2 years ago
- road-map & paper review for Reinforcement Learning☆47May 30, 2021Updated 4 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- Model-Based Offline Reinforcement Learning☆51Jan 13, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A mini racetrack world for developing and testing robots with AWS RoboMaker and Gazebo simulations.☆15Sep 8, 2020Updated 5 years ago
- Learning Perceptive Bipedal Locomotion over Irregular Terrain☆25Jun 29, 2023Updated 2 years ago
- ☆56Oct 10, 2025Updated 7 months ago
- Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…☆41Apr 17, 2024Updated 2 years ago
- Painless distributed training with torch☆12Apr 28, 2026Updated 3 weeks ago
- Official release of CompoSuite, a compositional RL benchmark☆50Jan 27, 2024Updated 2 years ago
- ☆26Apr 26, 2024Updated 2 years ago
- Optimization: principles and algorithms - Michel Bierlaire - EPFL Press - 2015☆15Aug 29, 2018Updated 7 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- gym RL environment in which a mujoco simulation of Agility Robotics' Cassie robot is rewarded for walking/running forward as fast as poss…☆35Nov 17, 2023Updated 2 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- Official Code for Teacher Assistant-Based Knowledge Distillation Extracting Multi-level Features on Single Channel Sleep EEG (IJCAI 2023)☆11Nov 4, 2023Updated 2 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- ☆34Jun 9, 2025Updated 11 months ago
- A UI world clock and Timer (working) and Alarm UI☆16May 13, 2020Updated 6 years ago
- [NeurIPS 2024] GACL: Exemplar-Free Generalized Analytic Continual Learning☆17Nov 5, 2024Updated last year