LucasAlegre / mbcdView external linksLinks
Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"
☆10Aug 7, 2023Updated 2 years ago
Alternatives and similar repositories for mbcd
Users that are interested in mbcd are comparing it to the libraries listed below
Sorting:
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- Posted at AAAI 2023☆11Sep 4, 2025Updated 5 months ago
- ☆16Jun 30, 2022Updated 3 years ago
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆23Jun 24, 2023Updated 2 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Oct 23, 2021Updated 4 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆27Feb 24, 2023Updated 2 years ago
- Code and data for decision making under strategic behavior, NeurIPS 2020 & Management Science 2024.☆29Feb 28, 2024Updated last year
- ☆27Jun 23, 2020Updated 5 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- ☆16Feb 1, 2026Updated 2 weeks ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- A generative deep learning model based on GAN architecture was implemented to generate synthetic network data (benign and malicious) alik…☆10Oct 23, 2021Updated 4 years ago
- GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents☆19Jan 26, 2026Updated 2 weeks ago
- A projet for simulating the rescue after a disaster☆10Dec 4, 2020Updated 5 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- ☆18Jul 17, 2025Updated 6 months ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- ☆11Nov 13, 2025Updated 3 months ago
- factory.ai FACTORY_API_KEY switch and query☆27Dec 6, 2025Updated 2 months ago
- Alpha mining with DEAP-based genetic programming.☆11Jul 7, 2023Updated 2 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- Accepted at WWW 25 Industrial Track (oral)☆16Jun 6, 2025Updated 8 months ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- This repository contains the code of the paper SO(2)-Equivariant Reinforcement Learning (ICLR 2022) and On-Robot Learning With Equivarian…☆38Oct 20, 2022Updated 3 years ago
- The ROS interface as well as the Python packages for ProSeCo Planning☆10Jun 17, 2024Updated last year
- ☆11Sep 30, 2022Updated 3 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- 基于强化学习的异构感知高能效联邦学习调度系统☆14Apr 13, 2024Updated last year
- ☆11Jun 28, 2022Updated 3 years ago
- Another Wheel to parse json☆11Mar 13, 2020Updated 5 years ago
- ☆14Feb 26, 2025Updated 11 months ago
- This is the numerical approach proposed in the paper "Optimal Incentives to Mitigate Epidemics: A Stackelberg Mean Field Game Approach" b…☆12Nov 22, 2021Updated 4 years ago
- ☆10Feb 28, 2019Updated 6 years ago
- ☆11Jul 10, 2025Updated 7 months ago
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 4 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- a cute lavender themed hud, designed to be "calming"☆11Jun 6, 2025Updated 8 months ago
- modified datasets for remote sensing image caption☆11Apr 23, 2019Updated 6 years ago