☆26Apr 12, 2018Updated 7 years ago
Alternatives and similar repositories for qmix
Users that are interested in qmix are comparing it to the libraries listed below
Sorting:
- QMIX implemented in TensorFlow 2☆17Jun 12, 2021Updated 4 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 6 years ago
- qmix☆23May 28, 2020Updated 5 years ago
- ☆17Dec 4, 2019Updated 6 years ago
- paper on dexpilot☆15Oct 14, 2019Updated 6 years ago
- The project to learn the QMIX.☆13Dec 19, 2019Updated 6 years ago
- a collection of DRL-repo in Github☆16Oct 21, 2020Updated 5 years ago
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆16May 24, 2020Updated 5 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- There will be updates later☆88May 13, 2019Updated 6 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆54Dec 8, 2022Updated 3 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"☆33Nov 30, 2023Updated 2 years ago
- Code for the papers "Modeling the Second Player in Distributionally Robust Optimization" and "Distributionally Robust Models with Paramet…☆29Apr 14, 2022Updated 3 years ago
- Implementation of the Discrete Soft Actor-Critic algorithm with RNN policy in PyTorch☆26Jan 7, 2023Updated 3 years ago
- ☆34Updated this week
- ☆17Oct 30, 2025Updated 4 months ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Jan 15, 2022Updated 4 years ago
- ☆35May 16, 2025Updated 9 months ago
- Jupyter notebooks of the simulations ran as part of a semester project on "Quantum Reinforcement Learning and Projective Simulation" at T…☆33Feb 22, 2019Updated 7 years ago
- Face Recognition on NVIDIA TX2☆10Sep 5, 2018Updated 7 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- Code for Weighted QMIX☆145Nov 12, 2020Updated 5 years ago
- ☆16Feb 22, 2025Updated last year
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆29Feb 23, 2026Updated last week
- ☆14Mar 21, 2024Updated last year
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- LLM Skirmish☆44Feb 3, 2026Updated last month
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- ☆10Jul 13, 2024Updated last year
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- ☆11Jan 11, 2022Updated 4 years ago
- ☆10May 23, 2023Updated 2 years ago
- ☆13May 3, 2024Updated last year