Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.
☆59May 25, 2021Updated 4 years ago
Alternatives and similar repositories for policy-distillation-baselines
Users that are interested in policy-distillation-baselines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 22, 2019Updated 6 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 4 years ago
- Avenue is a simulator designed to test and prototype reinforcement learning algorithms. Avenue is a ServiceNow Research project that was …☆14Jul 15, 2022Updated 3 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- ☆22May 20, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆115Jan 16, 2024Updated 2 years ago
- Unity로 멀티 에이전트 강화학습(MARL) 수행하기 위한 프레임 워크 제공☆24Apr 17, 2022Updated 4 years ago
- Official implementation of the project HuDOR: Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards project. Website:…☆33Apr 10, 2025Updated last year
- ☆17Oct 12, 2023Updated 2 years ago
- Quadcopter control with RL☆16Nov 8, 2021Updated 4 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 6 years ago
- Simulation system for path planning evaluation☆14Dec 13, 2025Updated 4 months ago
- ☆35Jan 29, 2023Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆74Jul 23, 2021Updated 4 years ago
- An LED panel resembling that at bus stop in Seoul☆13Jun 4, 2021Updated 4 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Jun 30, 2020Updated 5 years ago
- ☆70Sep 23, 2024Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆89Oct 15, 2023Updated 2 years ago
- ☆20Dec 2, 2024Updated last year
- ☆25Oct 9, 2024Updated last year
- A project copied from google-research which named motion-imitation was rewrited with PyTorch☆10Sep 30, 2022Updated 3 years ago
- A modified benchmark for designing and controlling 2D Voxel-based Soft Robots☆39Nov 18, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- road-map & paper review for Reinforcement Learning☆47May 30, 2021Updated 4 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- Model-Based Offline Reinforcement Learning☆51Jan 13, 2021Updated 5 years ago
- Learning Perceptive Bipedal Locomotion over Irregular Terrain☆25Jun 29, 2023Updated 2 years ago
- Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…☆41Apr 17, 2024Updated 2 years ago
- Official release of CompoSuite, a compositional RL benchmark☆50Jan 27, 2024Updated 2 years ago
- ☆26Apr 26, 2024Updated 2 years ago
- Optimization: principles and algorithms - Michel Bierlaire - EPFL Press - 2015☆15Aug 29, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- gym RL environment in which a mujoco simulation of Agility Robotics' Cassie robot is rewarded for walking/running forward as fast as poss…☆35Nov 17, 2023Updated 2 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- Official Code for Teacher Assistant-Based Knowledge Distillation Extracting Multi-level Features on Single Channel Sleep EEG (IJCAI 2023)☆11Nov 4, 2023Updated 2 years ago
- [NeurIPS 2024] GACL: Exemplar-Free Generalized Analytic Continual Learning☆17Nov 5, 2024Updated last year
- PyTorch implementation of our paper Real-Time Reinforcement Learning (NeurIPS 2019)☆76May 3, 2020Updated 6 years ago