Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning
☆16Mar 11, 2020Updated 5 years ago
Alternatives and similar repositories for RFQ-RFAC
Users that are interested in RFQ-RFAC are comparing it to the libraries listed below
Sorting:
- This is the code repository for the paper "Zero-Sum Stochastic Stackelberg Games".☆16Oct 12, 2022Updated 3 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆44Dec 31, 2021Updated 4 years ago
- ☆14Nov 26, 2022Updated 3 years ago
- Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"☆11Apr 8, 2025Updated 10 months ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- ☆16Oct 6, 2019Updated 6 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆15Mar 9, 2021Updated 4 years ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆19Jun 17, 2021Updated 4 years ago
- ☆20Sep 11, 2021Updated 4 years ago
- ☆20May 22, 2023Updated 2 years ago
- ☆27Dec 20, 2021Updated 4 years ago
- Code for "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning" ICML 2021☆67May 22, 2021Updated 4 years ago
- ☆33Dec 8, 2022Updated 3 years ago
- Multi Type Mean Field Reinforcement Learning☆31Jun 13, 2022Updated 3 years ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆36Jul 6, 2022Updated 3 years ago
- This is source code of our Master Thesis on "Decode and Forward Relay Assisting Active Jamming in NOMA system".☆12Feb 26, 2024Updated 2 years ago
- A projet for simulating the rescue after a disaster☆10Dec 4, 2020Updated 5 years ago
- Introduction to Gaussian Processes☆11Jan 13, 2024Updated 2 years ago
- FEN Code☆41Nov 4, 2019Updated 6 years ago
- Twitter-NFT sales bot that tweets individual and sweep sales with images from Opensea, Looksrare, X2Y2, and Blur using Opensea/Looksrare …☆13Jul 27, 2023Updated 2 years ago
- Coresets☆38Apr 24, 2022Updated 3 years ago
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning☆40Aug 17, 2022Updated 3 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆88Dec 8, 2022Updated 3 years ago
- Optimal placement of edge servers using K-means Clustering and Power allocation using Particle Swarm Optimization☆13Nov 22, 2021Updated 4 years ago
- Analysis of data from the videogame/eSport League of Legends☆12Nov 4, 2019Updated 6 years ago
- Public package to compute translationally and rotationally invariant wavelet-based statistics on images.☆10Aug 25, 2023Updated 2 years ago
- modified datasets for remote sensing image caption☆11Apr 23, 2019Updated 6 years ago
- ☆11Jun 28, 2022Updated 3 years ago
- ☆10Feb 28, 2019Updated 7 years ago
- ☆10Apr 26, 2023Updated 2 years ago
- The source code for an online Human-Agent Interaction (HAI) system, controlling directions (left or right) of Unity games using online EE…☆10Jan 17, 2021Updated 5 years ago
- Continual learning strategies(EWC, GEM) for rotated MNIST dataset☆12Apr 6, 2020Updated 5 years ago
- Salient Objects in Clutter, arXiv, 2021 (ECCV2018 extenstion).☆11Jun 17, 2021Updated 4 years ago
- ♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation☆10Jun 20, 2021Updated 4 years ago
- OpenAI Gym interfaces for multi-robot flocking problems☆40May 1, 2021Updated 4 years ago
- Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.☆11Jul 12, 2018Updated 7 years ago
- ☆10Aug 13, 2022Updated 3 years ago
- ☆12Jun 2, 2021Updated 4 years ago
- Official Code Repository for Sim-to-Real Deep Reinforcement Learning for UAV Obstacle Avoidance Under Measurement Uncertainty☆16Jul 12, 2023Updated 2 years ago