Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.
☆59May 25, 2021Updated 4 years ago
Alternatives and similar repositories for policy-distillation-baselines
Users that are interested in policy-distillation-baselines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 22, 2019Updated 6 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 4 years ago
- Avenue is a simulator designed to test and prototype reinforcement learning algorithms. Avenue is a ServiceNow Research project that was …☆14Jul 15, 2022Updated 3 years ago
- Repository for the IROS 2024 Paper "In-Hand Following of Deformable Linear Objects Using Dexterous Fingers with Tactile Sensing"☆18Dec 24, 2025Updated 3 months ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆115Jan 16, 2024Updated 2 years ago
- Unity로 멀티 에이전트 강화학습(MARL) 수행하기 위한 프레임 워크 제공☆24Apr 17, 2022Updated 3 years ago
- Official implementation of the project HuDOR: Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards project. Website:…☆33Apr 10, 2025Updated last year
- ☆17Oct 12, 2023Updated 2 years ago
- Quadcopter control with RL☆16Nov 8, 2021Updated 4 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 6 years ago
- Simulation system for path planning evaluation☆14Dec 13, 2025Updated 4 months ago
- ☆35Jan 29, 2023Updated 3 years ago
- Arduino Uno Wifi REST Server for the Jura S95 CoffeeMaker – Acts as a backend for Homebridge / Siri☆14Nov 12, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆74Jul 23, 2021Updated 4 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Jun 30, 2020Updated 5 years ago
- ☆70Sep 23, 2024Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆87Oct 15, 2023Updated 2 years ago
- ☆24Oct 9, 2024Updated last year
- A project copied from google-research which named motion-imitation was rewrited with PyTorch☆10Sep 30, 2022Updated 3 years ago
- A modified benchmark for designing and controlling 2D Voxel-based Soft Robots☆39Nov 18, 2023Updated 2 years ago
- road-map & paper review for Reinforcement Learning☆47May 30, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- Model-Based Offline Reinforcement Learning☆51Jan 13, 2021Updated 5 years ago
- TensorFlow code for paper: Learning Grid-like Units with Vector Representation of Self-Position and Matrix Representation of Self-Motion☆19Jan 4, 2019Updated 7 years ago
- ☆54Oct 10, 2025Updated 6 months ago
- Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…☆41Apr 17, 2024Updated last year
- Painless distributed training with torch☆12Apr 1, 2026Updated last week
- ☆26Apr 26, 2024Updated last year
- Optimization: principles and algorithms - Michel Bierlaire - EPFL Press - 2015☆15Aug 29, 2018Updated 7 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- gym RL environment in which a mujoco simulation of Agility Robotics' Cassie robot is rewarded for walking/running forward as fast as poss…☆35Nov 17, 2023Updated 2 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- Official Code for Teacher Assistant-Based Knowledge Distillation Extracting Multi-level Features on Single Channel Sleep EEG (IJCAI 2023)☆11Nov 4, 2023Updated 2 years ago
- ☆14Apr 29, 2025Updated 11 months ago
- ☆34Jun 9, 2025Updated 10 months ago
- ☆11Apr 7, 2021Updated 5 years ago
- A UI world clock and Timer (working) and Alarm UI☆16May 13, 2020Updated 5 years ago