LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmind's PySC2 Learning Environment API as a Python LLM Environment.
☆156Apr 24, 2025Updated last year
Alternatives and similar repositories for LLM-PySC2
Users that are interested in LLM-PySC2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TextStarCraft2,a pure language env which support llms play starcraft2☆344Apr 25, 2025Updated last year
- Hierarchical Expert Prompt for Large-Language-Models: An Approch Defeat Elite AI in TextStarCraft-II for the First Time☆60Oct 24, 2024Updated last year
- ☆37Jan 4, 2026Updated 5 months ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆45May 8, 2024Updated 2 years ago
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆53Apr 1, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆43Updated this week
- This project provides a set of translators to convert OpenAI Gym environments into text-based environments. It is designed to investigate…☆22May 29, 2024Updated 2 years ago
- This repo supports integrating LLMs and communication algorithms with MARL using SMAC as the platform. It provides an end-to-end workflow…☆20Mar 8, 2025Updated last year
- An environment based on JSBSIM aimed at one-to-one close air combat.☆481May 19, 2025Updated last year
- Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)☆712May 18, 2024Updated 2 years ago
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆50Mar 13, 2025Updated last year
- SMAC: The StarCraft Multi-Agent Challenge☆1,357Feb 18, 2024Updated 2 years ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago
- A simple and efficient llama3 local service deployment solution that supports real-time streaming response and is optimized for common Ch…☆13Jul 31, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- (JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play …☆367Nov 9, 2022Updated 3 years ago
- A plotter for reinforcement learning (RL) using Weights & Biases☆14Dec 20, 2023Updated 2 years ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆61Nov 22, 2025Updated 6 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Oct 27, 2025Updated 7 months ago
- ☆18Jul 14, 2023Updated 2 years ago
- Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)☆171Dec 8, 2022Updated 3 years ago
- MATE: the Multi-Agent Tracking Environment.☆46Mar 31, 2023Updated 3 years ago
- pytorch implementation of SAC, TD3 and TD7 with Mujoco Benchmark results from 4 seeds.☆15Jul 4, 2024Updated last year
- An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks☆55Jun 12, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆25Feb 10, 2024Updated 2 years ago
- A distributed GPU-centric experience replay system for large AI models.☆19Aug 1, 2023Updated 2 years ago
- Formation Flight in Airsim Simulator☆33Aug 27, 2024Updated last year
- Python framework for the reverse engineering of Hamiltonian models of quantum systems through machine learning.☆22Mar 31, 2023Updated 3 years ago
- Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning☆25Jun 25, 2025Updated 11 months ago
- ☆19Oct 27, 2025Updated 7 months ago
- ☆13Aug 15, 2020Updated 5 years ago
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python Multi-Agent Reinforcement Learning framework☆2,198Dec 8, 2022Updated 3 years ago
- The implementation of "An Imitative Reinforcement Learning Framework for Pursuit-Lock-Launch Missions"☆35Oct 29, 2025Updated 7 months ago
- ☆48Nov 29, 2021Updated 4 years ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- An extension of the PyMARL codebase that includes additional algorithms and environment support☆722Sep 24, 2024Updated last year
- 🐝 SwarmBench: Benchmarking LLMs' Swarm Intelligence☆34May 21, 2025Updated last year
- The implement of the policy gradient RL algorithm with pytorch☆41Dec 7, 2020Updated 5 years ago