A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models
☆51Apr 1, 2025Updated last year
Alternatives and similar repositories for LLM-SMAC
Users that are interested in LLM-SMAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆42Jul 24, 2025Updated 9 months ago
- TextStarCraft2,a pure language env which support llms play starcraft2☆313Apr 25, 2025Updated last year
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆56Aug 30, 2024Updated last year
- model based reinforcement learning algorithms for unstable baselines☆14May 9, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆50Jul 23, 2021Updated 4 years ago
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆30Nov 12, 2024Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆18Nov 24, 2025Updated 5 months ago
- LLM-Empowered State Representation for Reinforcement Learning (ICML2024 Accepted paper)☆38Jun 14, 2024Updated last year
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆23Jul 6, 2023Updated 2 years ago
- GPU-based Massively Parallel Environments for Large-Scale Combinatorial Optimization (CO) Problems Using Reinforcement Learning☆31Mar 16, 2026Updated last month
- Update kmeans in linemod into tsne and other clustering efforts.☆14Apr 15, 2019Updated 7 years ago
- [EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models☆18Oct 21, 2023Updated 2 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆14Nov 4, 2025Updated 5 months ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Nov 29, 2022Updated 3 years ago
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated 2 years ago
- Simulators and baselines for ATEC 2025 software algorithm track (online competition)☆11Apr 13, 2025Updated last year
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆21Dec 14, 2024Updated last year
- ☆16Jul 16, 2024Updated last year
- A ROS 2 package integrating the Soar cognitive architecture into the ROS ecosystem.☆19Updated this week
- ☆15May 11, 2023Updated 2 years ago
- ☆12Apr 17, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆23Jan 22, 2024Updated 2 years ago
- ☆17Oct 9, 2024Updated last year
- 基于轻量级 Qwen2.5-0.5B 和 SigLIP 的视觉语言多模态模型实现,包含训练和 SFT 代码。分享训练和 SFT 相关代码,记录一下探索和学习的过程。欢迎一起交流讨论~☆20Aug 31, 2025Updated 8 months ago
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆18Feb 21, 2025Updated last year
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆15Aug 12, 2024Updated last year
- ☆13May 13, 2025Updated 11 months ago
- ☆39Feb 29, 2024Updated 2 years ago
- Given one example of an annotated part, this model finds its semantic correspondences in a target image. Thus you get - one-shot semantic…☆29Sep 15, 2022Updated 3 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆59Dec 27, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆73Jun 13, 2024Updated last year
- Robust Multi-Agent Reinforcement Learning with State Uncertainty☆12May 30, 2023Updated 2 years ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆39Jul 13, 2024Updated last year
- safety analysis for hard-to-specify failures☆30Apr 19, 2026Updated last week
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 5 months ago
- Overcooked human-AI experiment platform☆39Dec 21, 2023Updated 2 years ago
- Code for "ALMA: Hierarchical Learning for Composite Multi-Agent Tasks" NeurIPS 2022☆33Sep 25, 2022Updated 3 years ago