jiechuanjiang/I2Q

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jiechuanjiang/I2Q)

jiechuanjiang / I2Q

I2Q: A Fully Decentralized Q-Learning Algorithm

☆19

Alternatives and similar repositories for I2Q

Users that are interested in I2Q are comparing it to the libraries listed below

Sorting:

uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
PKU-RL / RoadnetSZ
View on GitHub
☆17Feb 17, 2023Updated 3 years ago
uoe-agents / seps
View on GitHub
Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)
☆25Oct 26, 2021Updated 4 years ago
garrett4wade / revisiting_marl
View on GitHub
Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)
☆23Jul 16, 2022Updated 3 years ago
011235813 / cm3
View on GitHub
Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning
☆58Jun 13, 2022Updated 3 years ago
jiechuanjiang / GENE
View on GitHub
Generative Exploration and Exploitation
☆24Nov 27, 2021Updated 4 years ago
yeshenpy / RACE
View on GitHub
(ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…
☆42Oct 14, 2023Updated 2 years ago
nicknochnack / LongSpeechTranscription
View on GitHub
Transcribing long blocks of speech using Watson Speech To Text.
☆11Sep 24, 2020Updated 5 years ago
jjgonde / Alicante-Murcia-SUMO-Scenario
View on GitHub
Calibrated Alicante-Murcia Freeway SUMO Scenario
☆11Nov 28, 2019Updated 6 years ago
wangsssky / Refined-training-set-of-URPC2019
View on GitHub
underwater dataset, open-data
☆11Aug 22, 2021Updated 4 years ago
PKU-RL / CORRO
View on GitHub
[ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
☆40Aug 17, 2022Updated 3 years ago
vaibkumr / RL_from_scratch
View on GitHub
This is where I write RL related stuff from scratch
☆10Dec 15, 2019Updated 6 years ago
schwartenbeckph / Mechanisms_Exploration_Paper
View on GitHub
Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"
☆10May 22, 2020Updated 5 years ago
vickipedia6 / Tennis-Deep-Reinforcement-Learning
View on GitHub
Training Multiple agents in the same environment to collaborate and compete with each other
☆12Dec 1, 2019Updated 6 years ago
kuleshov / neural-variational-inference
View on GitHub
Neural variational inference and learning in undirected graphical models http://www.stanford.edu/~kuleshov/papers/nips2017.pdf
☆17Apr 25, 2018Updated 7 years ago
TihanyiD / multi_alloc
View on GitHub
☆12Apr 12, 2022Updated 3 years ago
byyx666 / ArchCraft
View on GitHub
This is official code implementation of the <Revisiting Neural Networks for Continual Learning: An Architectural Perspective> in IJCAI 20…
☆13Nov 25, 2024Updated last year
zhaoxuhui / A-Car-with-Stereo-and-IMU-for-Isaac-Sim
View on GitHub
☆12Feb 19, 2023Updated 3 years ago
ckling / promoss
View on GitHub
Promoss Topic Modelling Toolbox
☆11Jan 21, 2019Updated 7 years ago
LAMDA-RL / ODIS
View on GitHub
The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".
☆48Oct 31, 2024Updated last year
jonasrothfuss / model_ensemble_meta_learning
View on GitHub
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Nov 15, 2018Updated 7 years ago
prasoongoyal / bnp-vae
View on GitHub
☆11Mar 9, 2018Updated 8 years ago
xiaomi-research / dar
View on GitHub
DAR introduces the diagonal scanning order for next-token prediction and proposes a direction-aware autoregressive transformer framework.
☆18Apr 16, 2025Updated 10 months ago
AdityaRadya / Optimal-Routing-With-Genetic-Algorithm
View on GitHub
This is a program that calculates and finds the most optimal cost in a routing problem. The routing problem is described as having a Prod…
☆11Sep 10, 2018Updated 7 years ago
librahu / HACUD
View on GitHub
Source code for AAAI2019 paper "Cash-out User Detection based on Attributed Heterogeneous Information Network with a Hierarchical Attenti…
☆15Nov 12, 2018Updated 7 years ago
gsavarela / networked_agents
View on GitHub
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
☆12Jan 14, 2022Updated 4 years ago
zhuliwen / RoadnetSZ
View on GitHub
☆44May 8, 2024Updated last year
tomazas / itc2017
View on GitHub
Code for paper "Application of Convolutional Neural Networks to Four-Class Motor Imagery Classification Problem"
☆12Aug 31, 2017Updated 8 years ago
astirn / MV-Kumaraswamy
View on GitHub
☆13Dec 8, 2022Updated 3 years ago
mgerstgrasser / super
View on GitHub
suPER is a collaborative multi-agent RL algorithm
☆14Jun 11, 2024Updated last year
distributed-information-bottleneck / distributed-information-bottleneck.github.io
View on GitHub
A repository for using the distributed information bottleneck to locate information in data
☆17Aug 26, 2024Updated last year
tgangwani / GuidanceRewards
View on GitHub
Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)
☆12Jul 7, 2021Updated 4 years ago
andrekuros / Multi-UAV-TA-gym-env
View on GitHub
An Gym based enviroment to evaluate Multi Uav Task Alocation Algorithm
☆13Feb 9, 2024Updated 2 years ago
ActiveInferenceInstitute / GeneralizedNotationNotation
View on GitHub
☆22Updated this week
angelobanse / sumoScenarioGenerator
View on GitHub
SUMO Scenario Generator is a web application that generates and downloads the necessary files to start a basic road traffic simulation in…
☆12Jun 25, 2020Updated 5 years ago
PKU-RL / I2C
View on GitHub
☆46Jun 29, 2021Updated 4 years ago
HyunghoNa / EMU
View on GitHub
(Official) PyTorch implementation for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning (EMU) (ICLR…
☆53May 23, 2024Updated last year
gunnarfloetteroed / java
View on GitHub
☆13May 20, 2022Updated 3 years ago
grasp-lyrl / scalableMARL
View on GitHub
Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking (IROS22).
☆11Jul 22, 2022Updated 3 years ago