Reproducing Policy Distillation (DeepMind paper ICLR 2016)
☆22Feb 17, 2020Updated 6 years ago
Alternatives and similar repositories for policydistillation
Users that are interested in policydistillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- ☆15Nov 22, 2019Updated 6 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Machine Learning Course Project Skoltech 2018☆109Feb 11, 2019Updated 7 years ago
- Design good curriculums for deep reinforcement learning☆14May 18, 2016Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- rlcourse-march-17-hugobb created by GitHub Classroom☆16Jul 3, 2024Updated last year
- Planning with inferred internal states of other players in general-sum differential games.☆17May 3, 2022Updated 4 years ago
- Core interface to design, solve, and simulate trajectory games.☆21Dec 6, 2024Updated last year
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Multi-agent coordination using game theory and nonlinear opinion dynamics - CDC 2023☆14Nov 29, 2023Updated 2 years ago
- ☆85May 29, 2019Updated 6 years ago
- Implementation of Attentive Multi Task Deep Reinforcement Learning Architecture in Tensorflow☆15Apr 5, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for Automatic Curriculum Learning through Value Disagreement☆31Jun 15, 2020Updated 5 years ago
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- ☆15Sep 22, 2023Updated 2 years ago
- From simulation to real world using deep generative models☆18Sep 30, 2018Updated 7 years ago
- Control with Deep Reinforcement Learning☆16Sep 14, 2023Updated 2 years ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- Pytorch code for Arxiv Paper: Learning to learn: Meta-Critic Networks for Sample-Efficient Learning☆57Apr 3, 2018Updated 8 years ago
- A reinforcement learning algorithm controller for a satellite using the orekit library☆20Feb 20, 2022Updated 4 years ago
- ☆30Nov 10, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Supporting code for "Parallel Streaming Wasserstein Barycenters"☆11Nov 14, 2017Updated 8 years ago
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 7 years ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- Implementation of the paper 'Stochastic Wasserstein Barycenters'☆11Oct 17, 2018Updated 7 years ago
- ☆19Feb 18, 2024Updated 2 years ago
- This reposotory is for a project about Distributed TDMA for Mobile UWB Network Localization☆15Jun 1, 2021Updated 4 years ago
- Computing mixed-strategy Nash Equilibria for games involving multiple players☆25Jan 16, 2025Updated last year
- Simple GStreamer test programs for learning puporses.☆13Jul 27, 2013Updated 12 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ROMFS文件系统固件解析与提取☆12Dec 24, 2023Updated 2 years ago
- DeepSeek R1 distilled into smaller OSS models for hobbyist☆17Dec 2, 2025Updated 5 months ago
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆11Mar 5, 2021Updated 5 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 7 years ago
- CFG-GAN: Composite functional gradient learning of generative adversarial models☆15Jul 9, 2020Updated 5 years ago
- The CLI & python API for the well-known project gpt-academic.☆19Sep 22, 2024Updated last year
- Gstreamer, Qt, RTSP server☆15Sep 7, 2018Updated 7 years ago