Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.
☆61May 25, 2021Updated 5 years ago
Alternatives and similar repositories for policy-distillation-baselines
Users that are interested in policy-distillation-baselines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 22, 2019Updated 6 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 5 years ago
- Repository for the IROS 2024 Paper "In-Hand Following of Deformable Linear Objects Using Dexterous Fingers with Tactile Sensing"☆20Dec 24, 2025Updated 5 months ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- ☆22May 20, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆116Jan 16, 2024Updated 2 years ago
- Unity로 멀티 에이전트 강화학습(MARL) 수행하기 위한 프레임 워크 제공☆24Apr 17, 2022Updated 4 years ago
- Official implementation of the project HuDOR: Bridging the Human to Robot Dexterity Gap through Object-Oriented Rewards project. Website:…☆34Apr 10, 2025Updated last year
- ☆17Oct 12, 2023Updated 2 years ago
- Quadcopter control with RL☆16Nov 8, 2021Updated 4 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 7 years ago
- Simulation system for path planning evaluation☆12Dec 13, 2025Updated 6 months ago
- ☆35Jan 29, 2023Updated 3 years ago
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX☆74Jul 23, 2021Updated 4 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Jun 30, 2020Updated 5 years ago
- ☆73Sep 23, 2024Updated last year
- ☆21Dec 2, 2024Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆91Oct 15, 2023Updated 2 years ago
- ☆25Oct 9, 2024Updated last year
- A project copied from google-research which named motion-imitation was rewrited with PyTorch☆10Sep 30, 2022Updated 3 years ago
- A modified benchmark for designing and controlling 2D Voxel-based Soft Robots☆40Nov 18, 2023Updated 2 years ago
- road-map & paper review for Reinforcement Learning☆46May 30, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- Jupyter notebook with the code of a probabilistic neural network in PyTorch☆12Jan 17, 2020Updated 6 years ago
- Model-Based Offline Reinforcement Learning☆51Jan 13, 2021Updated 5 years ago
- A mini racetrack world for developing and testing robots with AWS RoboMaker and Gazebo simulations.☆15Sep 8, 2020Updated 5 years ago
- Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…☆41Apr 17, 2024Updated 2 years ago
- Painless distributed training with torch☆12Apr 28, 2026Updated last month
- Official release of CompoSuite, a compositional RL benchmark☆51Jan 27, 2024Updated 2 years ago
- ☆27Apr 26, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- gym RL environment in which a mujoco simulation of Agility Robotics' Cassie robot is rewarded for walking/running forward as fast as poss…☆35Nov 17, 2023Updated 2 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- ☆14Apr 29, 2025Updated last year
- ☆12Jul 6, 2023Updated 2 years ago
- ☆34Jun 9, 2025Updated last year