The code for paper 'STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning'
☆17Oct 6, 2024Updated last year
Alternatives and similar repositories for STAS
Users that are interested in STAS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official code for our paper entitled "Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning".☆10Aug 19, 2025Updated 9 months ago
- Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"☆20Mar 18, 2025Updated last year
- Pytorch implementation of AREL☆16Dec 20, 2021Updated 4 years ago
- [NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularizat…☆43Mar 3, 2024Updated 2 years ago
- Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)☆15Aug 15, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated 2 years ago
- Repository for "Known Unknowns: Uncertainty Quality in Bayesian Neural Networks" paper.☆12Mar 3, 2017Updated 9 years ago
- ☆38Nov 15, 2025Updated 6 months ago
- Automated Continuous Data Quality Measurement☆12Nov 15, 2023Updated 2 years ago
- Ramp metering agent trained by TD3 algorithm using SUMO☆19Oct 15, 2023Updated 2 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- The implementation of our IROS submission manuscript paper InteractionNet. Coming soon.☆25Mar 13, 2024Updated 2 years ago
- ☆12Jul 15, 2020Updated 5 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [NeurIPS 2023] Conformal Prediction for Uncertainty-Aware Planning with Diffusion Dynamics Model☆20Dec 9, 2023Updated 2 years ago
- Multi-view Reinforcement Learning☆11Feb 9, 2020Updated 6 years ago
- Codes of GoMARL accompanying the paper "Automatic Grouping for Efficient Cooperative Multi-Agent Reinforcement Learning"(NeurIPS 2023). G…☆32Aug 14, 2024Updated last year
- Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.☆41Feb 18, 2025Updated last year
- ☆13Jul 2, 2020Updated 5 years ago
- Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models☆256Oct 4, 2023Updated 2 years ago
- Inertial Measurement Unit (IMU) Driver based on on the MPU6050☆10Mar 23, 2020Updated 6 years ago
- Official code for "Self-Distilled Agentic Reinforcement Learning"☆138Updated this week
- Implementation of proof of concept quantum enhanced reinforced learning algorithm, able to find the sequence of quantum gates needed to a…☆15Mar 29, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Federated Deep Reinforcement Learning for Swarm Robotic Systems☆10Jun 2, 2022Updated 3 years ago
- Code for the NeurIPS 2023 Paper: Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Sta…☆30Oct 29, 2023Updated 2 years ago
- A Deep Reinforcement Learning model for high volume and frequency Forex Portfolio Management☆13Jan 11, 2023Updated 3 years ago
- Bulk and single-cell Multi-Omics ground truth Simulator in R☆12Feb 10, 2026Updated 3 months ago
- ST-GAT: Spatio-Temporal Graph Attention Network for TrafficFlow Prediction☆13Dec 1, 2022Updated 3 years ago
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆36Aug 20, 2025Updated 9 months ago
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated 2 years ago
- LaTeX Template Satisfying McMaster Thesis Formatting Requirements☆16May 6, 2014Updated 12 years ago
- ☆13Feb 10, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11May 24, 2020Updated 6 years ago
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆45Oct 14, 2023Updated 2 years ago
- ROS implementation of meta planning + FaSTrack!☆16Sep 23, 2020Updated 5 years ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆28Feb 8, 2020Updated 6 years ago
- ☆13Aug 15, 2020Updated 5 years ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Heterogeneous Multi-Robot Reinforcement Learning☆72Nov 10, 2025Updated 6 months ago