☆15Nov 22, 2019Updated 6 years ago
Alternatives and similar repositories for policy-distillation
Users that are interested in policy-distillation are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.☆59May 25, 2021Updated 4 years ago
- Source code for Pathfinding in Stochastic Environments paper.☆15Oct 27, 2022Updated 3 years ago
- An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"☆17Sep 27, 2019Updated 6 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 6 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆22Jun 6, 2018Updated 7 years ago
- Reproducing Policy Distillation (DeepMind paper ICLR 2016)☆22Feb 17, 2020Updated 6 years ago
- PyTorch implementation of GAIL and PPO reinforcement learning algorithms☆26May 7, 2021Updated 4 years ago
- 뇌를 자극하는 시스템 프로그래밍☆13Mar 2, 2023Updated 3 years ago
- PyTorch IMPALA implementation☆27Aug 31, 2019Updated 6 years ago
- RL Algorithms for Visual Continuous Control☆36May 31, 2023Updated 2 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 5 years ago
- Cornell House Agent Learning Environment☆47Jun 22, 2022Updated 3 years ago
- My Body Is A Cage☆41Apr 13, 2021Updated 4 years ago
- 北京 青年大学习 使用Github Actions自动完成☆10Nov 5, 2022Updated 3 years ago
- In this project, we give python and C++ codes for the Ring Polymer Molecular Dynamics (RMPD) to calculate the time correlation function(…☆12Dec 31, 2017Updated 8 years ago
- Train an RL agent to play multiple Atari games at once☆69Jun 6, 2016Updated 9 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- Deep Reinforcement Learning based Autonomous Driving Agents☆10Jul 7, 2022Updated 3 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- ☆14Apr 29, 2025Updated 10 months ago
- Our first-year mathematics graduate school notes☆10Dec 20, 2021Updated 4 years ago
- ☆10Aug 18, 2022Updated 3 years ago
- Active Learning of Abstract Plan Feasibility☆12Feb 10, 2023Updated 3 years ago
- This is a tutorial of using Kubeflow to build model, train model and deploy model serving.☆14Nov 22, 2022Updated 3 years ago
- Code for PolyTask: Learning Unified Policies through Behavior Distillation☆12Oct 13, 2023Updated 2 years ago
- Solving the card game 6 nimmt! with reinforcement learning☆14Dec 31, 2021Updated 4 years ago
- Implementation of Oridinal Classification Paper using Logistic Regression and SVM☆12Jun 10, 2017Updated 8 years ago
- (AAAI23) Riemannian Local Mechanism for SPD Neural Networks☆10Mar 11, 2024Updated last year
- Code associated with the project http://predimportance.mit.edu/☆12Aug 7, 2020Updated 5 years ago
- This repository is the official implementation of Low-Rank Modular Reinforcement Learning via Muscle Synergy.☆11Oct 27, 2022Updated 3 years ago
- Open AI Gym environment of the Missile Command Atari game.☆14May 23, 2023Updated 2 years ago
- ☆11Dec 23, 2025Updated 2 months ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- attention으로 시계열 예측은 할 수 없을까☆10Apr 30, 2021Updated 4 years ago
- Resilient Steel Structures Laboratory (RESSLab) Python Library☆11Dec 26, 2022Updated 3 years ago
- ☆10Jan 29, 2021Updated 5 years ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- Explore Fibonacci, Galois, and State Space Linear Feedback Shift Register (LFSR) sequence generators☆12Dec 29, 2020Updated 5 years ago
- (Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning☆10Dec 8, 2022Updated 3 years ago