xmu-rl-3dv/RiskQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xmu-rl-3dv/RiskQ)

xmu-rl-3dv / RiskQ

☆15

Alternatives and similar repositories for RiskQ

Users that are interested in RiskQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xmu-rl-3dv / ResQ
View on GitHub
The source code of "ResQ: A Residual Q Function-based Approach for Multi-Agent Reinforcement Learning Value Factorization. NeurIPS 2022"
☆19Oct 17, 2022Updated 3 years ago
xmu-rl-3dv / DoF
View on GitHub
☆18Feb 24, 2025Updated last year
KaiYan289 / RL_as_Vitamin_for_Online_Decision_Transformers
View on GitHub
☆16Dec 5, 2024Updated last year
benellis3 / pymarl2
View on GitHub
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
☆19Aug 20, 2023Updated 2 years ago
j3soon / dfac
View on GitHub
[ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
☆31Jun 1, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LXXXXR / Kaleidoscope
View on GitHub
[NeurIPS' 24] The PyTorch implementation of our paper: "Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learnin…
☆21Oct 10, 2024Updated last year
LXXXXR / ICES
View on GitHub
[ICML' 24] The PyTorch implementation of our paper: "Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforc…
☆25May 29, 2024Updated 2 years ago
RaghuHemadri / Multi-Agent-Reinforcement-Learning-Survey-Papers
View on GitHub
☆35Jul 14, 2021Updated 5 years ago
ling-pan / OMAR
View on GitHub
☆55Jul 21, 2022Updated 4 years ago
saizhang0218 / TMC
View on GitHub
Pytorch implementation of "Succinct and Robust Multi-Agent Communication With Temporal Message Control"
☆27Dec 6, 2020Updated 5 years ago
Pavankunchala / Reinforcement-learning-with-verifable-rewards-Learnings
View on GitHub
RLVR Testing and Training
☆21Aug 28, 2025Updated 11 months ago
junaiddk / transmix
View on GitHub
TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning
☆11Oct 18, 2022Updated 3 years ago
TonghanWang / EITI-EDTI
View on GitHub
Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)
☆34Mar 16, 2020Updated 6 years ago
EzgiKorkmaz / generalization-reinforcement-learning
View on GitHub
A Survey Analyzing Generalization in Deep Reinforcement Learning
☆36Oct 31, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
WeihaoTan / gym-macro-overcooked
View on GitHub
☆16May 11, 2023Updated 3 years ago
secury / optidice
View on GitHub
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
☆16Aug 3, 2023Updated 2 years ago
zzq-bot / offline-marl-framework-offpymarl
View on GitHub
Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.
☆35Oct 7, 2024Updated last year
SafeRL-Lab / Robust-RL-Baselines
View on GitHub
Robust Reinforcement Learning Benchmark
☆13Sep 22, 2024Updated last year
zoeyuchao / MPE-pytorch
View on GitHub
This is MPE-pytorch, fix some bugs.
☆11Apr 26, 2020Updated 6 years ago
maoliyuan / ODICE-Pytorch
View on GitHub
official implementation of ODICE
☆19Jan 31, 2024Updated 2 years ago
cwj22 / BeT-AIL
View on GitHub
☆13Mar 18, 2024Updated 2 years ago
KAIST-AILab / gmmil
View on GitHub
Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"
☆11Oct 2, 2018Updated 7 years ago
john-hewitt / truncation-sampling
View on GitHub
Codebase describing experiments in Truncation Sampling as Language Model Desmoothing
☆13Dec 6, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zichuan-liu / NA2Q
View on GitHub
[ICML'23] Official PyTorch Implementation of NA2Q, and a comprehensive benchmark based on pymarl
☆24Jan 14, 2024Updated 2 years ago
jinala / multi-agent-neurosym-transformers
View on GitHub
Neurosymbolic transformers for multi-agent communication.
☆23Oct 22, 2020Updated 5 years ago
uoe-agents / PO-GPL
View on GitHub
Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"
☆15Mar 1, 2023Updated 3 years ago
rtavenar / MatchAndDeform
View on GitHub
☆12Feb 29, 2024Updated 2 years ago
JernejPuc / sidegame-py
View on GitHub
SiDeGame - Simplified Defusal Game
☆12Apr 17, 2025Updated last year
222464 / TeensyAtariPlayingAgent
View on GitHub
An agent for playing Atari games running on a Teensy microcontroller
☆14Nov 11, 2022Updated 3 years ago
retna319 / SMNN
View on GitHub
Scalable Monotonic Neural Networks
☆12Mar 14, 2024Updated 2 years ago
tjuHaoXiaotian / Qfamily_for_MatrixGame
View on GitHub
We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.
☆16Jul 18, 2022Updated 4 years ago
NrLabFreiburg / inverse-q-learning
View on GitHub
☆15Oct 16, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
antonio-f / Dynamic-Programming
View on GitHub
Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…
☆11Apr 3, 2019Updated 7 years ago
lab-v2 / pyreason-rl-sim
View on GitHub
☆15Sep 28, 2023Updated 2 years ago
anthonybrunel / FLYBO
View on GitHub
A Unified Benchmark Environment for Autonomous Flying Robots
☆14Jul 12, 2024Updated 2 years ago
rajevv / Multi_L2D
View on GitHub
Code for Learning to Defer to Multiple Experts: Consistent Surrogate Losses, Confidence Calibration, and Conformal Ensembles [AISTATS'23]
☆13Jul 28, 2023Updated 3 years ago
CNDOTA / NeurIPS22-ATM
View on GitHub
☆15Oct 9, 2022Updated 3 years ago
BBVA / UMAL
View on GitHub
Modelling heterogeneous distributions with an Uncountable Mixture of Asymmetric Laplacians
☆18Oct 27, 2019Updated 6 years ago
sebjai / robust-risk-aware-rl
View on GitHub
Some implementations from the paper robust risk aware reinforcement learning
☆37Dec 15, 2021Updated 4 years ago