A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models
☆53Apr 1, 2025Updated last year
Alternatives and similar repositories for LLM-SMAC
Users that are interested in LLM-SMAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆19Jun 11, 2024Updated last year
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆43May 29, 2026Updated last week
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆57Aug 30, 2024Updated last year
- model based reinforcement learning algorithms for unstable baselines☆15May 9, 2023Updated 3 years ago
- ☆50Jul 23, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆32Nov 12, 2024Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆17Nov 24, 2025Updated 6 months ago
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆39Jun 14, 2024Updated last year
- GPU-based Massively Parallel Environments for Large-Scale Combinatorial Optimization (CO) Problems Using Reinforcement Learning☆31Mar 16, 2026Updated 2 months ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆23Jul 6, 2023Updated 2 years ago
- This repo supports integrating LLMs and communication algorithms with MARL using SMAC as the platform. It provides an end-to-end workflow…☆20Mar 8, 2025Updated last year
- ☆19Aug 22, 2025Updated 9 months ago
- [EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models☆18Oct 21, 2023Updated 2 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 7 months ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Nov 29, 2022Updated 3 years ago
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated 2 years ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆156Apr 24, 2025Updated last year
- Simulators and baselines for ATEC 2025 software algorithm track (online competition)☆11Apr 13, 2025Updated last year
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆21Dec 14, 2024Updated last year
- ☆16Jul 16, 2024Updated last year
- A ROS 2 package integrating the Soar cognitive architecture into the ROS ecosystem.☆19Jun 1, 2026Updated last week
- ☆15May 11, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago
- ☆18Oct 9, 2024Updated last year
- 基于轻量级 Qwen2.5-0.5B 和 SigLIP 的视觉语言多模态模型实现,包含训练和 SFT 代码。分享训练和 SFT 相关代码,记录一下探索和学习的过程。欢迎一起交流讨论~☆20Aug 31, 2025Updated 9 months ago
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆18Feb 21, 2025Updated last year
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆15Aug 12, 2024Updated last year
- ☆13May 13, 2025Updated last year
- ☆39Feb 29, 2024Updated 2 years ago
- Given one example of an annotated part, this model finds its semantic correspondences in a target image. Thus you get - one-shot semantic…☆29Sep 15, 2022Updated 3 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆59Dec 27, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆74Jun 13, 2024Updated last year
- Robust Multi-Agent Reinforcement Learning with State Uncertainty☆12May 30, 2023Updated 3 years ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆40Jul 13, 2024Updated last year
- Code for "ALMA: Hierarchical Learning for Composite Multi-Agent Tasks" NeurIPS 2022☆33Sep 25, 2022Updated 3 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆45Dec 31, 2021Updated 4 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆87Dec 17, 2024Updated last year
- Score and Distribution Matching Policy: Advanced accelerated Visuomotor Policies via matched distillation☆12May 9, 2025Updated last year