A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models
☆53Apr 1, 2025Updated last year
Alternatives and similar repositories for LLM-SMAC
Users that are interested in LLM-SMAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆43Jul 24, 2025Updated 9 months ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆337Apr 25, 2025Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆56Aug 30, 2024Updated last year
- model based reinforcement learning algorithms for unstable baselines☆15May 9, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆32Nov 12, 2024Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆18Nov 24, 2025Updated 5 months ago
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆38Jun 14, 2024Updated last year
- GPU-based Massively Parallel Environments for Large-Scale Combinatorial Optimization (CO) Problems Using Reinforcement Learning☆31Mar 16, 2026Updated 2 months ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆23Jul 6, 2023Updated 2 years ago
- Update kmeans in linemod into tsne and other clustering efforts.☆14Apr 15, 2019Updated 7 years ago
- [EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models☆18Oct 21, 2023Updated 2 years ago
- ☆10Jun 15, 2024Updated last year
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated 2 years ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆155Apr 24, 2025Updated last year
- Simulators and baselines for ATEC 2025 software algorithm track (online competition)☆11Apr 13, 2025Updated last year
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆21Dec 14, 2024Updated last year
- ☆16Jul 16, 2024Updated last year
- A ROS 2 package integrating the Soar cognitive architecture into the ROS ecosystem.☆19Apr 26, 2026Updated 3 weeks ago
- ☆15May 11, 2023Updated 3 years ago
- ☆12Apr 17, 2023Updated 3 years ago
- 基于轻量级 Qwen2.5-0.5B 和 SigLIP 的视觉语言多模态模型实现,包含训练和 SFT 代码。分享训练和 SFT 相关代码,记录一下探索和学习的过程。欢迎一起交流讨论~☆20Aug 31, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆18Feb 21, 2025Updated last year
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆15Aug 12, 2024Updated last year
- ☆13May 13, 2025Updated last year
- Given one example of an annotated part, this model finds its semantic correspondences in a target image. Thus you get - one-shot semantic…☆29Sep 15, 2022Updated 3 years ago
- ☆60Updated this week
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆59Dec 27, 2023Updated 2 years ago
- Robust Multi-Agent Reinforcement Learning with State Uncertainty☆12May 30, 2023Updated 2 years ago
- Optical flow with convolutional neural networks for vision-based guidance of UAS☆11Aug 23, 2017Updated 8 years ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆40Jul 13, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- safety analysis for hard-to-specify failures☆30Apr 19, 2026Updated last month
- Overcooked human-AI experiment platform☆39Dec 21, 2023Updated 2 years ago
- The code for AAMAS2022 《GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning》☆45Dec 31, 2021Updated 4 years ago
- Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is …☆86Dec 17, 2024Updated last year
- Score and Distribution Matching Policy: Advanced accelerated Visuomotor Policies via matched distillation☆11May 9, 2025Updated last year
- MATE: the Multi-Agent Tracking Environment.☆43Mar 31, 2023Updated 3 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago