☆15Nov 22, 2019Updated 6 years ago
Alternatives and similar repositories for policy-distillation
Users that are interested in policy-distillation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for Pathfinding in Stochastic Environments paper.☆15Oct 27, 2022Updated 3 years ago
- Planning with inferred internal states of other players in general-sum differential games.☆17May 3, 2022Updated 4 years ago
- Machine Learning Course Project Skoltech 2018☆109Feb 11, 2019Updated 7 years ago
- Core interface to design, solve, and simulate trajectory games.☆21Dec 6, 2024Updated last year
- An easy to understand implementation of the paper "Model-Based Reinforcement Learning for Atari"☆18Sep 27, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Julia interface to the PATH solver☆15Jan 26, 2021Updated 5 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- [IEEE IV 22'] Code for 'Improved Deep Reinforcement Learning with Expert Demonstrationsfor Urban Autonomous Driving'☆14Jun 17, 2021Updated 4 years ago
- ☆39Jan 8, 2020Updated 6 years ago
- Train an RL agent to play multiple Atari games at once☆69Jun 6, 2016Updated 9 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Oct 22, 2019Updated 6 years ago
- Officially unofficial PyTorch code for the NIPS paper 'Natural-Parameter Networks: A Class of Probabilistic Neural Networks'☆11Sep 28, 2021Updated 4 years ago
- CS525/DS595 Course Project - Deep Reinforcement Learning for Decision Making in Autonomous Driving☆14Dec 12, 2022Updated 3 years ago
- Multi-agent coordination using game theory and nonlinear opinion dynamics - CDC 2023☆14Nov 29, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Deep Reinforcement Learning navigation of autonomous vehicles. Implementation of deep-Q learning, dyna-Q learning, Q-learning agents incl…☆12Oct 29, 2024Updated last year
- A research project that leverages reinforcement learning and game theory in self-driving cars☆18Jun 6, 2021Updated 4 years ago
- Code for the paper "Importance Weighted Transfer of Samples in Reinforcement Learning" (ICML 2018).☆16May 29, 2018Updated 7 years ago
- Implementation of Attentive Multi Task Deep Reinforcement Learning Architecture in Tensorflow☆15Apr 5, 2019Updated 7 years ago
- Reimplementation of Policy Optimization with Demonstrations (POfD) from ICML 2018.☆16Jun 5, 2019Updated 6 years ago
- ☆17May 25, 2024Updated last year
- Single Episode Policy Transfer in Reinforcement Learning☆17Jun 13, 2022Updated 3 years ago
- The code has been implemented in Carla Simulator with the help of Double DQN to train an agent how to drive autonomously using different …☆16Aug 20, 2019Updated 6 years ago
- Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…☆41Apr 17, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Project exploring Multi Task Deep Reinforcement Learning neural network architectures and algorithms with Open AI Gym and TensorFlow☆17Sep 5, 2018Updated 7 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆22Jun 6, 2018Updated 7 years ago
- Control with Deep Reinforcement Learning☆16Sep 14, 2023Updated 2 years ago
- Code for PolyTask: Learning Unified Policies through Behavior Distillation☆11Oct 13, 2023Updated 2 years ago
- fix8 (Fixate) is an Open-Source GUI Tool for Working with Eye Tracking Data in Reading Tasks.☆16Feb 15, 2026Updated 2 months ago
- Code release for LiReN: Lifelong Autonomous Improvement of Robot Foundation Models in the Wild☆11Jan 28, 2025Updated last year
- self-driving car supervised by Valeo, this project aims to develop a self-driving car model using imitation learning technique on Carla u…☆14Sep 8, 2022Updated 3 years ago
- Pupillometry software for EPSRC CVD☆14Jul 6, 2023Updated 2 years ago
- An MPC algorithm which supports polytopic state and action constraints, using CEM optimisation.☆18Oct 1, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SQL Server图书管理系统的数据库☆10Apr 25, 2019Updated 7 years ago
- The repository is intended as a support tool for the report of the project "Sim to Real transfer of Reinforcement Learning Policies in Ro…☆13Mar 19, 2023Updated 3 years ago
- ☆13Apr 25, 2023Updated 3 years ago
- Code to reproduce the Arena environment experiments from Direct Behavior Specification via Constrained Reinforcement Learning.☆22Sep 10, 2022Updated 3 years ago
- A Framework for Safe and Accelerated Reinforcement Learning-based Radio Resource Management☆20Oct 1, 2022Updated 3 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Apr 28, 2019Updated 7 years ago
- Code for the paper "Continual Model-Based Reinforcement Learning with Hypernetworks"☆15Jul 28, 2021Updated 4 years ago