dion-jy/policy-distillation-baselines

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dion-jy/policy-distillation-baselines)

dion-jy / policy-distillation-baselines

Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.

☆62

Alternatives and similar repositories for policy-distillation-baselines

Users that are interested in policy-distillation-baselines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mee321 / policy-distillation
View on GitHub
☆15Nov 22, 2019Updated 6 years ago
dion-jy / gym-td3-keras
View on GitHub
Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework
☆11May 29, 2021Updated 5 years ago
philipjball / ReadyPolicyOne
View on GitHub
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Jul 6, 2023Updated 3 years ago
thethaibinh / agile_flight
View on GitHub
Simulation system for path planning evaluation
☆13Dec 13, 2025Updated 7 months ago
Shanghai-Digital-Brain-Laboratory / DB-Football
View on GitHub
A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.
☆118Jan 16, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
diversepsro / diverse_psro
View on GitHub
☆22May 20, 2021Updated 5 years ago
jk96491 / PredatorPrey
View on GitHub
Unity로 멀티 에이전트 강화학습(MARL) 수행하기 위한 프레임 워크 제공
☆24Apr 17, 2022Updated 4 years ago
akshitj1 / uav-mujoco
View on GitHub
Quadcopter control with RL
☆16Nov 8, 2021Updated 4 years ago
fiberleif / POfD
View on GitHub
Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.
☆16Jun 5, 2019Updated 7 years ago
CarperAI / Algorithm-Distillation-RLHF
View on GitHub
☆35Jan 29, 2023Updated 3 years ago
zhouzypaul / wsrl
View on GitHub
JAX implementation of WSRL and RL baselines | ICLR 2025
☆145Feb 26, 2026Updated 4 months ago
jk96491 / SMAC
View on GitHub
StarCraft II Multi Agent Challenge : QMIX, COMA, LIIR, QTRAN, Central V, ROMA, RODE, DOP, Graph MIX
☆73Jul 23, 2021Updated 5 years ago
siddharthverma314 / clcp-neurips-2020
View on GitHub
Code for Continual Learning of Control Primitives
☆18Nov 11, 2020Updated 5 years ago
hengyuan-hu / ibrl
View on GitHub
☆74Sep 23, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Mingrui-Yu / DLO_following
View on GitHub
Repository for the IROS 2024 Paper "In-Hand Following of Deformable Linear Objects Using Dexterous Fingers with Tactile Sensing"
☆22Dec 24, 2025Updated 7 months ago
ElisevanderPol / PRAE
View on GitHub
Plannable Approximations to MDP Homomorphisms: Equivariance under Actions
☆30Jun 30, 2020Updated 6 years ago
hletrd / LED_bus_panel
View on GitHub
An LED panel resembling that at bus stop in Seoul
☆14Jun 4, 2021Updated 5 years ago
rll-research / teachable
View on GitHub
☆17Oct 12, 2023Updated 2 years ago
seohongpark / METRA
View on GitHub
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆92Oct 15, 2023Updated 2 years ago
dion-jy / rl-paper-review
View on GitHub
road-map & paper review for Reinforcement Learning
☆47May 30, 2021Updated 5 years ago
Yuxing-Wang-THU / ModularEvoGym
View on GitHub
A modified benchmark for designing and controlling 2D Voxel-based Soft Robots
☆41Nov 18, 2023Updated 2 years ago
fmaxgarcia / Meta-MDP
View on GitHub
☆10Nov 4, 2019Updated 6 years ago
SwapnilPande / MOReL
View on GitHub
Model-Based Offline Reinforcement Learning
☆51Jan 13, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Adaptive-RL / AdaRL-code
View on GitHub
Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…
☆41Apr 17, 2024Updated 2 years ago
perrin-isir / gym-cassie-run
View on GitHub
gym RL environment in which a mujoco simulation of Agility Robotics' Cassie robot is rewarded for walking/running forward as fast as poss…
☆35Nov 17, 2023Updated 2 years ago
MahanFathi / Model-Based-RL
View on GitHub
Model-based Policy Gradients
☆32Mar 12, 2020Updated 6 years ago
gisbi-kim / robot-intelligence-lectures
View on GitHub
VLA & JEPA 대학원 강의 슬라이드
☆16May 2, 2026Updated 2 months ago
atavakol / action-hypergraph-networks
View on GitHub
(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Jun 22, 2021Updated 5 years ago
omron-sinicx / action-constrained-RL-benchmark
View on GitHub
☆28Apr 26, 2024Updated 2 years ago
Lifelong-ML / CompoSuite
View on GitHub
Official release of CompoSuite, a compositional RL benchmark
☆51Jan 27, 2024Updated 2 years ago
DiLi-Lab / ScanDL
View on GitHub
☆14Apr 29, 2025Updated last year
polixir / causal-mbrl
View on GitHub
Toolkit of Causal Model-based Reinforcement Learning.
☆33Jun 5, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ezhan94 / calibratable-style-consistency
View on GitHub
☆11Jun 5, 2023Updated 3 years ago
HychaoWang / SleepKD
View on GitHub
Official Code for Teacher Assistant-Based Knowledge Distillation Extracting Multi-level Features on Single Channel Sleep EEG (IJCAI 2023)
☆11Nov 4, 2023Updated 2 years ago
heechulbae / simulation
View on GitHub
Discrete Event Simulation using Simpy to run model based and model free deep reinforcement learning dispatch policies in a stochastic que…
☆21Nov 18, 2018Updated 7 years ago
joenghl / HYPO
View on GitHub
☆14Dec 29, 2023Updated 2 years ago
KU-LIM-Lab / DRL-course
View on GitHub
☆22May 14, 2021Updated 5 years ago
seongyeon1 / oh-my-slides
View on GitHub
A Claude Code plugin that generates animation-rich HTML presentations from natural language prompts. 20 curated design presets, PPTX expo…
☆16May 15, 2026Updated 2 months ago
jjalcaraz-upct / network-slicing
View on GitHub
Network slicing gym environment and a model-based RL agent with kernels
☆68Nov 19, 2024Updated last year