The code for paper 'STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning'
☆17Oct 6, 2024Updated last year
Alternatives and similar repositories for STAS
Users that are interested in STAS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official code for our paper entitled "Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning".☆10Aug 19, 2025Updated 8 months ago
- Official code for "DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data".☆26Oct 31, 2024Updated last year
- Pytorch implementation of AREL☆16Dec 20, 2021Updated 4 years ago
- [AAAI-25] Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning.☆32May 29, 2025Updated 11 months ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)☆15Aug 15, 2025Updated 8 months ago
- [ICML' 24] The PyTorch implementation of our paper: "Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforc…☆24May 29, 2024Updated last year
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated 2 years ago
- ☆37Nov 15, 2025Updated 5 months ago
- Repository for "Known Unknowns: Uncertainty Quality in Bayesian Neural Networks" paper.☆12Mar 3, 2017Updated 9 years ago
- ☆29Oct 2, 2023Updated 2 years ago
- Ramp metering agent trained by TD3 algorithm using SUMO☆19Oct 15, 2023Updated 2 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- ☆27Nov 7, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The implementation of our IROS submission manuscript paper InteractionNet. Coming soon.☆25Mar 13, 2024Updated 2 years ago
- ☆12Jul 15, 2020Updated 5 years ago
- [NeurIPS 2023] Conformal Prediction for Uncertainty-Aware Planning with Diffusion Dynamics Model☆20Dec 9, 2023Updated 2 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- Codes of GoMARL accompanying the paper "Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning"(NeurIPS 2023). G…☆31Aug 14, 2024Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆41Feb 18, 2025Updated last year
- ☆13Jul 2, 2020Updated 5 years ago
- Adaptive Machine Learning-Based Stock Prediction using Financial Time Series Technical Indicators☆10Dec 21, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Inertial Measurement Unit (IMU) Driver based on on the MPU6050☆10Mar 23, 2020Updated 6 years ago
- Implementation of proof of concept quantum enhanced reinforced learning algorithm, able to find the sequence of quantum gates needed to a…☆15Mar 29, 2022Updated 4 years ago
- Federated Deep Reinforcement Learning for Swarm Robotic Systems☆10Jun 2, 2022Updated 3 years ago
- Code for the NeurIPS 2023 Paper: Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Sta…☆30Oct 29, 2023Updated 2 years ago
- A Deep Reinforcement Learning model for high volume and frequency Forex Portfolio Management☆13Jan 11, 2023Updated 3 years ago
- ST-GAT: Spatio-Temporal Graph Attention Network for TrafficFlow Prediction☆13Dec 1, 2022Updated 3 years ago
- Bulk and single-cell Multi-Omics ground truth Simulator in R☆12Feb 10, 2026Updated 2 months ago
- ICML'2024: HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning☆24Jun 3, 2024Updated last year
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LaTeX Template Satisfying McMaster Thesis Formatting Requirements☆16May 6, 2014Updated 12 years ago
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Mar 9, 2017Updated 9 years ago
- Open sourced implementation of a prototype for Hyperledger Fabric chaincode execution with OP-TEE. This work is part of the master thesis…☆14Sep 10, 2019Updated 6 years ago
- ☆11May 24, 2020Updated 5 years ago
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆45Oct 14, 2023Updated 2 years ago
- ROS implementation of meta planning + FaSTrack!☆16Sep 23, 2020Updated 5 years ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆28Feb 8, 2020Updated 6 years ago