Reproducing Policy Distillation (DeepMind paper ICLR 2016)
☆22Feb 17, 2020Updated 6 years ago
Alternatives and similar repositories for policydistillation
Users that are interested in policydistillation are comparing it to the libraries listed below
Sorting:
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- ☆15Nov 22, 2019Updated 6 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Machine Learning Course Project Skoltech 2018☆109Feb 11, 2019Updated 7 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- ☆31Nov 21, 2018Updated 7 years ago
- ☆19Feb 18, 2024Updated 2 years ago
- StarCraft: BroodWars OpenAI Gym environment☆84Jan 8, 2019Updated 7 years ago
- Docker image for Left 4 Dead 2 (L4D2) server.☆11Oct 5, 2020Updated 5 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Slimebound character mod for Slay the Spire☆14Jun 30, 2020Updated 5 years ago
- 模拟键盘输入进行粘贴,用OCR识图进行文本复制☆11May 26, 2023Updated 2 years ago
- A program to convert the given regular expression to Non Definite Automata (NFA)☆10Feb 3, 2019Updated 7 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- CarND Capstone☆10Apr 2, 2018Updated 7 years ago
- Active Learning of Abstract Plan Feasibility☆12Feb 10, 2023Updated 3 years ago
- a feature frontend for VINS☆10Aug 27, 2018Updated 7 years ago
- An unfinished implementation of ESKF based stereo VIO algorithm☆11Jan 6, 2018Updated 8 years ago
- ROMFS文件系统固件解析与提取☆12Dec 24, 2023Updated 2 years ago
- Data and code for "Probing Spurious Correlations in Popular Event-Based Rumor Detection Benchmarks" (ECML-PKDD 2022)☆11Jun 12, 2023Updated 2 years ago
- QPSK-Modem☆11Oct 7, 2012Updated 13 years ago
- Standalone utility to encrypt files with ice encryption, that doesn't depend on Steam.☆10Aug 28, 2013Updated 12 years ago
- C++开发的web框架---正在实现功能中☆10Nov 3, 2019Updated 6 years ago
- a simple vpn forked from android sdk and xiaoxia.org/2012/02/21/udpip-vpn☆13Dec 20, 2014Updated 11 years ago
- Real valued neural networks (RVNN) and complex valued neural networks (CVNN) (Akira Hirose, 2012).☆11Jul 17, 2017Updated 8 years ago
- ☆10Aug 18, 2022Updated 3 years ago
- NIPS 2017 Value Prediction Network☆167Jan 12, 2018Updated 8 years ago
- NILC-USP at SemEval-2017 Task 4: A Multi-view Ensemble for Twitter Sentiment Analysis☆10Feb 19, 2017Updated 9 years ago
- Official repository for the paper "Automating Continual Learning"☆18Jun 11, 2025Updated 8 months ago
- Official release of CompoSuite, a compositional RL benchmark☆50Jan 27, 2024Updated 2 years ago
- 使用Python的LeetCode解题笔记,详情访问 http://leetcode.xyu.ink/☆11Sep 7, 2021Updated 4 years ago
- A fork of ns3 LTE module for reinforcement learning experiments☆13Feb 20, 2017Updated 9 years ago
- An image super-resolution implemented by Tensorflow☆11Nov 25, 2019Updated 6 years ago
- bilibili yeah!!!!☆13Jan 16, 2021Updated 5 years ago
- ☆12May 21, 2017Updated 8 years ago
- Python wrapper for MuJoCo physics simulation.☆12Feb 14, 2019Updated 7 years ago