☆26Apr 12, 2018Updated 8 years ago
Alternatives and similar repositories for qmix
Users that are interested in qmix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- qmix☆23May 28, 2020Updated 5 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 6 years ago
- The project to learn the QMIX.☆13Dec 19, 2019Updated 6 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆16May 24, 2020Updated 5 years ago
- ☆10Feb 28, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the papers "Modeling the Second Player in Distributionally Robust Optimization" and "Distributionally Robust Models with Paramet…☆29Apr 14, 2022Updated 4 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- There will be updates later☆89May 13, 2019Updated 6 years ago
- ☆44Oct 27, 2018Updated 7 years ago
- ☆43Feb 12, 2020Updated 6 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- Level-Based Foraging (LBF): A multi-agent reinforcement learning environment☆58Sep 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Knowledge Distillation Algorithms implemented with PyTorch☆17Jul 23, 2019Updated 6 years ago
- ☆13Jul 2, 2025Updated 9 months ago
- Setup for Octo and some experiments with the model☆12Apr 11, 2024Updated 2 years ago
- 🔬 Absolutely comfort lab for me to work around with my own AIs and to empirically observe how powerful and impactful these technologies …☆28Jan 12, 2025Updated last year
- Implementation of DeDOL algorithm - Deep Reinforcement Learning based algorithm for Green Security Games with Real Time Information☆16Nov 7, 2019Updated 6 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆132Aug 14, 2023Updated 2 years ago
- Multi-Modal Imitation Learning in Partially Observable Environments☆14Sep 5, 2020Updated 5 years ago
- Resilient Multi-Agent Reinforcement Learning☆10Nov 4, 2022Updated 3 years ago
- ☆26Jun 4, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- BILIBILI.☆15Jan 6, 2019Updated 7 years ago
- ☆12Jun 17, 2022Updated 3 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆54Dec 8, 2022Updated 3 years ago
- ☆15Dec 31, 2020Updated 5 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- ☆19Oct 30, 2025Updated 5 months ago
- 浙江大学Beamer模板☆15May 19, 2022Updated 3 years ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆96Jul 27, 2022Updated 3 years ago
- Python neighbor-joining library. Goal: Efficient O(n^2) neighbor-joining algorithm.☆12May 5, 2014Updated 11 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 3 years ago
- SMAC: The StarCraft Multi-Agent Challenge☆1,340Feb 18, 2024Updated 2 years ago
- An implementation of Counterfactual Regret Minimization (CFR) via Temporal Difference (TD) learning☆22May 11, 2013Updated 12 years ago
- Partial implementation of ODE-GAN technique from the paper Training Generative Adversarial Networks by Solving Ordinary Differential Equa…☆16Nov 12, 2020Updated 5 years ago
- ☆35Apr 2, 2026Updated last week