A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models
☆53Apr 1, 2025Updated last year
Alternatives and similar repositories for LLM-SMAC
Users that are interested in LLM-SMAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆20Jun 11, 2024Updated 2 years ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆43Jun 23, 2026Updated last week
- TextStarCraft2,a pure language env which support llms play starcraft2☆344Apr 25, 2025Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆57Aug 30, 2024Updated last year
- model based reinforcement learning algorithms for unstable baselines☆15May 9, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆50Jul 23, 2021Updated 4 years ago
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆32Nov 12, 2024Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Nov 24, 2025Updated 7 months ago
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆41Jun 14, 2024Updated 2 years ago
- GPU-based Massively Parallel Environments for Large-Scale Combinatorial Optimization (CO) Problems Using Reinforcement Learning☆32Mar 16, 2026Updated 3 months ago
- ☆19Aug 22, 2025Updated 10 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated last year
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆15Nov 4, 2025Updated 7 months ago
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆156Apr 24, 2025Updated last year
- Simulators and baselines for ATEC 2025 software algorithm track (online competition)☆11Apr 13, 2025Updated last year
- ☆16Jul 16, 2024Updated last year
- A ROS 2 package integrating the Soar cognitive architecture into the ROS ecosystem.☆19Jun 24, 2026Updated last week
- ☆15May 11, 2023Updated 3 years ago
- ☆12Apr 17, 2023Updated 3 years ago
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago
- ☆18Oct 9, 2024Updated last year
- 基于轻量级 Qwen2.5-0.5B 和 SigLIP 的视觉语言多模态模型实现,包含训练和 SFT 代码。分享训练和 SFT 相关代码,记录一下探索和学习的过程。欢迎一起交流讨论~☆20Aug 31, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆20Feb 21, 2025Updated last year
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆15Aug 12, 2024Updated last year
- ☆14May 13, 2025Updated last year
- An implementation of ARMCI using MPI one-sided communication (RMA)☆17Oct 28, 2025Updated 8 months ago
- ☆39Feb 29, 2024Updated 2 years ago
- Given one example of an annotated part, this model finds its semantic correspondences in a target image. Thus you get - one-shot semantic…☆29Sep 15, 2022Updated 3 years ago
- ☆27May 19, 2025Updated last year
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆59Dec 27, 2023Updated 2 years ago
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆76Jun 13, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Robust Multi-Agent Reinforcement Learning with State Uncertainty☆12May 30, 2023Updated 3 years ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆42Jul 13, 2024Updated last year
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 7 months ago
- Overcooked human-AI experiment platform☆40Dec 21, 2023Updated 2 years ago
- Code for "ALMA: Hierarchical Learning for Composite Multi-Agent Tasks" NeurIPS 2022☆33Sep 25, 2022Updated 3 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆45Dec 31, 2021Updated 4 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆88Dec 17, 2024Updated last year