wwxFromTju / maddpg-tfView external linksLinks
use tensorflow to implement the MADDPG(simple_tag)
☆18Jan 7, 2018Updated 8 years ago
Alternatives and similar repositories for maddpg-tf
Users that are interested in maddpg-tf are comparing it to the libraries listed below
Sorting:
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 6 years ago
- an implementation of CommNet☆35Nov 14, 2017Updated 8 years ago
- scalable multi agents reinforcement learning☆63Apr 20, 2018Updated 7 years ago
- Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.☆107Dec 6, 2020Updated 5 years ago
- ☆22Sep 28, 2018Updated 7 years ago
- CommNet and BiCnet implementation in tensorflow☆56Jul 27, 2018Updated 7 years ago
- Code for "SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning" AAAI 2020☆26Dec 8, 2022Updated 3 years ago
- Implementation of Relational Deep Reinforcement Learning☆25Jan 31, 2020Updated 6 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Oct 3, 2023Updated 2 years ago
- ☆32Jun 25, 2018Updated 7 years ago
- Learning Individual Intrinsic Reward in MARL☆63Dec 8, 2022Updated 3 years ago
- An RPG Maker MZ plugin☆12Nov 2, 2023Updated 2 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆32Oct 9, 2018Updated 7 years ago
- ☆91Oct 23, 2021Updated 4 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- Pytorch implementation of NASA: NEURAL ARTICULATED SHAPE APPROXIMATION☆12May 4, 2021Updated 4 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆15Feb 23, 2025Updated 11 months ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- Simple implementation of an AABB Tree (Axis Aligned Bounding Box Tree) to optimize 3d collision detection☆10Oct 22, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- [JBHI 2024] HierAttn: Deeply Supervised Skin Lesions Diagnosis with Stage and Branch Attention☆11Nov 16, 2024Updated last year
- ☆10Nov 27, 2019Updated 6 years ago
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Feb 18, 2025Updated 11 months ago
- Implimenting DDPG Algorithm in Tensorflow-2.0☆10Mar 25, 2023Updated 2 years ago
- 强化学习面试(未完待续)☆34Dec 20, 2019Updated 6 years ago
- ☆16Mar 14, 2025Updated 11 months ago
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Jun 18, 2018Updated 7 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- [Review] Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environment☆10Dec 22, 2018Updated 7 years ago
- Implementation of Soft Actor-Critic (SAC) algorithm using TensorFlow 2.1.0☆12May 13, 2020Updated 5 years ago
- ☆11Mar 5, 2024Updated last year
- Implementations of different reinforcement learning algorithms☆10Aug 23, 2018Updated 7 years ago
- Attend - to what matters.☆17Feb 22, 2025Updated 11 months ago
- We provide a very simple implementation of the typical value decomposition methods for solving single state Matrix Games.☆16Jul 18, 2022Updated 3 years ago
- A short guide and example on how to fine-tune OpenAI's gpt-3.5-turbo for better roleplay☆14Aug 26, 2023Updated 2 years ago
- [CVPR 2025 - HuMoGen] "MDMP: Multi-modal Diffusion for supervised Motion Predictions with uncertainty"☆16Mar 12, 2025Updated 11 months ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago