Reproducing Policy Distillation (DeepMind paper ICLR 2016)
☆22Feb 17, 2020Updated 6 years ago
Alternatives and similar repositories for policydistillation
Users that are interested in policydistillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- ☆15Nov 22, 2019Updated 6 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- ☆10Aug 18, 2022Updated 3 years ago
- Machine Learning Course Project Skoltech 2018☆109Feb 11, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Design good curriculums for deep reinforcement learning☆14May 18, 2016Updated 9 years ago
- rlcourse-march-17-hugobb created by GitHub Classroom☆16Jul 3, 2024Updated last year
- Planning with inferred internal states of other players in general-sum differential games.☆17May 3, 2022Updated 4 years ago
- This course introduced me to three cutting-edge technologies for privacy-preserving AI: Federated Learning, Differential Privacy, and Enc…☆11Sep 2, 2019Updated 6 years ago
- A Julia interface to the PATH solver☆15Jan 26, 2021Updated 5 years ago
- A2C for GVG-AI☆22Nov 7, 2018Updated 7 years ago
- Inverse Reinforcement learning proof-of-concept using the Guided Cost/Reward Learning approach☆10Mar 23, 2020Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Multi-agent coordination using game theory and nonlinear opinion dynamics - CDC 2023☆14Nov 29, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆85May 29, 2019Updated 6 years ago
- Implementation of Attentive Multi Task Deep Reinforcement Learning Architecture in Tensorflow☆15Apr 5, 2019Updated 7 years ago
- [ICLR'20] Learning to Learn by Zeroth-Order Oracle☆14Feb 7, 2020Updated 6 years ago
- AlphaGo Zero Reinforcement Learning Sokoban Solver☆11Jun 20, 2018Updated 7 years ago
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- ☆15Sep 22, 2023Updated 2 years ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 8 years ago
- ☆29Nov 10, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆42Oct 31, 2012Updated 13 years ago
- Group project "Algorithms for large-scale optimal transport". Implement ADMMs and Sinkhorn's Algorithms.☆11Jan 28, 2019Updated 7 years ago
- Supporting code for "Parallel Streaming Wasserstein Barycenters"☆11Nov 14, 2017Updated 8 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago
- PoPS algorithm☆15Dec 8, 2022Updated 3 years ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- Implementation of the paper 'Stochastic Wasserstein Barycenters'☆11Oct 17, 2018Updated 7 years ago
- ☆19Feb 18, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This reposotory is for a project about Distributed TDMA for Mobile UWB Network Localization☆15Jun 1, 2021Updated 4 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- Simple GStreamer test programs for learning puporses.☆13Jul 27, 2013Updated 12 years ago
- DeepSeek R1 distilled into smaller OSS models☆17Dec 2, 2025Updated 5 months ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 7 years ago
- Gstreamer, Qt, RTSP server☆15Sep 7, 2018Updated 7 years ago
- 使用Python的LeetCode解题笔记,详情访问 http://leetcode.xyu.ink/☆11Sep 7, 2021Updated 4 years ago