Reproducing Policy Distillation (DeepMind paper ICLR 2016)
☆22Feb 17, 2020Updated 6 years ago
Alternatives and similar repositories for policydistillation
Users that are interested in policydistillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- ☆15Nov 22, 2019Updated 6 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- ☆10Aug 18, 2022Updated 3 years ago
- Machine Learning Course Project Skoltech 2018☆109Feb 11, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Design good curriculums for deep reinforcement learning☆14May 18, 2016Updated 9 years ago
- rlcourse-march-17-hugobb created by GitHub Classroom☆16Jul 3, 2024Updated last year
- A Julia interface to the PATH solver☆15Jan 26, 2021Updated 5 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- Inverse Reinforcement learning proof-of-concept using the Guided Cost/Reward Learning approach☆10Mar 23, 2020Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Multi-agent coordination using game theory and nonlinear opinion dynamics - CDC 2023☆14Nov 29, 2023Updated 2 years ago
- ☆85May 29, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of Attentive Multi Task Deep Reinforcement Learning Architecture in Tensorflow☆15Apr 5, 2019Updated 6 years ago
- Code for Automatic Curriculum Learning through Value Disagreement☆31Jun 15, 2020Updated 5 years ago
- [ICLR'20] Learning to Learn by Zeroth-Order Oracle☆14Feb 7, 2020Updated 6 years ago
- AlphaGo Zero Reinforcement Learning Sokoban Solver☆11Jun 20, 2018Updated 7 years ago
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- Project exploring Multi Task Deep Reinforcement Learning neural network architectures and algorithms with Open AI Gym and TensorFlow☆17Sep 5, 2018Updated 7 years ago
- ☆15Sep 22, 2023Updated 2 years ago
- From simulation to real world using deep generative models☆18Sep 30, 2018Updated 7 years ago
- Control with Deep Reinforcement Learning☆16Sep 14, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 7 years ago
- ☆29Nov 10, 2025Updated 4 months ago
- Group project "Algorithms for large-scale optimal transport". Implement ADMMs and Sinkhorn's Algorithms.☆11Jan 28, 2019Updated 7 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 7 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- Implementation of the paper 'Stochastic Wasserstein Barycenters'☆11Oct 17, 2018Updated 7 years ago
- ☆19Feb 18, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This reposotory is for a project about Distributed TDMA for Mobile UWB Network Localization☆15Jun 1, 2021Updated 4 years ago
- Computing mixed-strategy Nash Equilibria for games involving multiple players☆25Jan 16, 2025Updated last year
- ROMFS文件系统固件解析与提取☆12Dec 24, 2023Updated 2 years ago
- DeepSeek R1 distilled into smaller OSS models☆17Dec 2, 2025Updated 3 months ago
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆11Mar 5, 2021Updated 5 years ago
- Trust Region Policy Optimization with Generalized Advantage Estimator☆16Nov 15, 2018Updated 7 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago