david-simoes-93 / Multi-agent-Double-Deep-Q-NetworksView external linksLinks
A multi-agent version of the Double DQN algorithm, with Foraging Task and Pursuit Game test scenarios
☆13Apr 24, 2017Updated 8 years ago
Alternatives and similar repositories for Multi-agent-Double-Deep-Q-Networks
Users that are interested in Multi-agent-Double-Deep-Q-Networks are comparing it to the libraries listed below
Sorting:
- Deep Q Network for Multi-agent RL☆15Oct 18, 2020Updated 5 years ago
- Project on multi agent reinforcement learning applied on patrolling agents☆40Dec 10, 2019Updated 6 years ago
- Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment☆11Oct 20, 2018Updated 7 years ago
- Adaptation of DQN, DDQN and COMA for multi-agent Gym environments☆11Oct 3, 2023Updated 2 years ago
- Continual Multi-agent Reinforcement Learning in Dynamic Environments☆11Jul 1, 2021Updated 4 years ago
- Research project - real-time multi-agent pursuit a moving target☆17Mar 13, 2021Updated 4 years ago
- Multiagent deep reinforcement learning research project☆28Jan 25, 2026Updated 2 weeks ago
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- kaggle: IEEE-CIS Fraud Detection☆30Oct 10, 2019Updated 6 years ago
- Implement Google Deep Minds DQN for multiple agents for a grid world environment where vehicles must pick up customers.☆29Mar 7, 2018Updated 7 years ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 7 months ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆35May 14, 2019Updated 6 years ago
- ☆77Jan 5, 2018Updated 8 years ago
- Code for "Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning"☆36May 22, 2021Updated 4 years ago
- Deep Q-learning (DQN) for Multi-agent Reinforcement Learning (RL)☆355May 12, 2020Updated 5 years ago
- ☆12Jan 24, 2022Updated 4 years ago
- Tensorflow implementation of the paper "Fast Compressive Sensing Using Generative Model with Structed Latent Variables"☆10Apr 7, 2020Updated 5 years ago
- Information geometry and its extension information topology☆11Dec 2, 2017Updated 8 years ago
- CDbw Index For Cluster Validation☆10Mar 26, 2019Updated 6 years ago
- Classification of human emotion using multi-modal models☆12Jun 27, 2020Updated 5 years ago
- Implementation of the G2RL approach in the POGEMA environment☆13Jun 5, 2024Updated last year
- Fast Fourier Transform (FFT) implementation for Unity☆10Aug 15, 2017Updated 8 years ago
- The SPA Studios Sequencer Addon Re-Kindled☆12Feb 3, 2026Updated last week
- Evaluation Pipeline for medical tasks.☆12Updated this week
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 3 years ago
- 第八届“泰迪杯”数据挖掘挑战赛的一点心得☆10Nov 26, 2020Updated 5 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- ☆11Oct 2, 2020Updated 5 years ago
- ☆12Feb 19, 2025Updated 11 months ago
- Deep Learning based technique for Unsupervised Anomaly Detection using DeepAnT and LSTM Autoencoder☆11Jul 25, 2020Updated 5 years ago
- Sound2Mesh is an attempt at making a fully functionnal 3D spectrogram in blender.☆26Feb 4, 2026Updated last week
- Research code and scripts used in the Silburt et al. (2021) EMNLP 2021 paper 'FANATIC: FAst Noise-Aware TopIc Clustering'☆11Jul 6, 2023Updated 2 years ago
- Deep Counterfactual Prediction with Categorical Backward Variables☆12Feb 8, 2023Updated 3 years ago
- 📐 Results for the Ranking Metrics submission @ GLB 2022☆10Apr 6, 2022Updated 3 years ago
- character recognition, textline recognition☆10Aug 31, 2019Updated 6 years ago
- SDN for Intra-Vehicular Networks☆10May 25, 2021Updated 4 years ago
- A small memory game written in Unity with an MVC architecture using Zenject and UniRx.☆10Feb 21, 2022Updated 3 years ago
- Code for "Efficient Relation-aware Scoring Function Search for Knowledge Graph Embedding" ICDE 2021☆11Apr 26, 2021Updated 4 years ago
- ☆10Apr 11, 2022Updated 3 years ago