Implementation of NeurIPS 2018 paper "Meta-Gradient Reinforcement Learning"
☆22Jul 19, 2022Updated 3 years ago
Alternatives and similar repositories for meta_gradient_RL
Users that are interested in meta_gradient_RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Integrate AutoRL into DQN to implement a single traffic signal control system.☆16Nov 16, 2023Updated 2 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Jul 18, 2025Updated 10 months ago
- solver for discrete Mixed Observable Markov Decision Processes☆11Oct 30, 2020Updated 5 years ago
- ☆11Jan 13, 2026Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Efficient Exploration through Bayesian Deep Q-Networks☆37Feb 14, 2018Updated 8 years ago
- Code for my publication: Deep Learning Predictive Band Switching in Wireless Networks. Paper accepted for publication to IEEE Transaction…☆17Jul 2, 2020Updated 5 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- An environment for table-carrying, a joint-action cooperative task.☆10Jan 8, 2024Updated 2 years ago
- ☆13Jun 3, 2022Updated 3 years ago
- ☆20Jul 6, 2025Updated 10 months ago
- gym-auv repository upgraded to Stable-Baselines 3☆12Aug 24, 2023Updated 2 years ago
- The Manta v1 software architecture for Autonomous Underwater Vehicles (AUVs) - Master's thesis☆10Aug 11, 2022Updated 3 years ago
- A python API for plane detection in point clouds☆12Apr 22, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆28Mar 7, 2024Updated 2 years ago
- ⚡️ Shockingly fast imitation learning algorithms via combining online and offline data engines. ⚡️☆13Sep 1, 2025Updated 8 months ago
- MetaPlanner is an open source automated treatment planning method that performs meta-optimization of treatment planning hyperparameters. …☆14Nov 7, 2023Updated 2 years ago
- Berkeley DeepDrive Drone Dataset☆12Apr 15, 2025Updated last year
- pytorch implementation of grok☆11May 18, 2026Updated last week
- [IEEE-TITS] Official implementation of paper "A Survey on the Application of Large Language Models in Scenario-Based Testing of Automated…☆33Jan 23, 2026Updated 4 months ago
- quadruped simulation using unitree a1 in pybullet, controller code from stanford pupper☆15May 19, 2021Updated 5 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆18Oct 25, 2022Updated 3 years ago
- A simple RNN meta-learner☆10Dec 17, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Jul 23, 2023Updated 2 years ago
- ☆30Jan 29, 2024Updated 2 years ago
- Contrastive Distillation for Incremental Class Learning in Semantic Segmentation☆14Dec 13, 2021Updated 4 years ago
- An autonomous driving agent with a Safety model and the ATtention mechanism in a multi-task framework.☆15Jan 13, 2023Updated 3 years ago
- A convolutional autoencoder for feature extraction, with an SVM for image classification.☆10Jan 30, 2019Updated 7 years ago
- NASA Project; Plastic Marine Debris Classification-Machine Learning Software☆16Oct 12, 2021Updated 4 years ago
- Shared autonomy via deep reinforcement learning☆80Mar 24, 2023Updated 3 years ago
- Transfer Learning in Reinforcement Learning using Stable-Baseline3 | Transfer Reinforcement Learning for Differing Action Spaces via Q-Ne…☆22Feb 27, 2022Updated 4 years ago
- A packet loss detection and location solution based on AM-PM and INT, suitable for Mininet environment, written in P4 language.☆18May 23, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Control inverted pendulum by LQR in OpenAI Gym☆12Oct 2, 2024Updated last year
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆29Dec 28, 2017Updated 8 years ago
- Implementation of Dueling Network Architectures for Deep Reinforcement Learning paper with Pytorch☆14Sep 26, 2020Updated 5 years ago
- Official documentation of I-24 MOTION data products☆21Oct 16, 2023Updated 2 years ago
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆27Jan 21, 2026Updated 4 months ago
- Tensorflow implementation for Robust Adversarial Reinforcement Learning: https://arxiv.org/pdf/1703.02702.pdf☆28Mar 7, 2018Updated 8 years ago
- Implementation of Deep Q-learning from Demonstrations using Keras and a Retro Gym environment.☆14Jul 16, 2018Updated 7 years ago