Enabling Mixed Opponent Strategy Script and Self-play on SMAC
☆41Jul 24, 2025Updated 7 months ago
Alternatives and similar repositories for smac-hard
Users that are interested in smac-hard are comparing it to the libraries listed below
Sorting:
- A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models☆51Apr 1, 2025Updated 11 months ago
- GPU-based Massively Parallel Environments for Large-Scale Combinatorial Optimization (CO) Problems Using Reinforcement Learning☆28Feb 6, 2026Updated 3 weeks ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- ☆32Jan 4, 2026Updated last month
- A simple and efficient llama3 local service deployment solution that supports real-time streaming response and is optimized for common Ch…☆13Jul 31, 2024Updated last year
- A simple 2D ball collision engine.☆12Jun 15, 2023Updated 2 years ago
- The official implementation of "Transformer in Transformer as Backbone for Deep Reinforcement Learning"☆56Dec 27, 2023Updated 2 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 3 months ago
- Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”☆30Dec 7, 2025Updated 2 months ago
- Official implementation of paper: LiNo: Advancing Recursive Residual Decomposition of Linear and Nonlinear Patterns for Robust Time Serie…☆18Dec 19, 2025Updated 2 months ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆28Feb 25, 2025Updated last year
- M^3PC: Test-Time Model Predictive Control for Pretrained Masked Trajectory Model, ICLR 2025☆19Mar 17, 2025Updated 11 months ago
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆42May 8, 2024Updated last year
- ☆25Jun 10, 2025Updated 8 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- MuZero for Combinatorial Action Spaces: open-source codebase for MA-Gumbel-AlphaZero, MA-Sampled-AlphaZero, MA-Gumbel-MuZero and MA-Sampl…☆22Jan 22, 2024Updated 2 years ago
- Cutting-edge platform for LLM agent tuning. Deliver RL tuning with flexibility, reliability, speed, multi-agent optimization and realtime…☆53Updated this week
- A Sim-to-Real Single-Stage Planner for Off-Road Terrain☆40Jul 1, 2025Updated 8 months ago
- Official Implementation of "NeuralPlane: An Efficiently Parallelizable Platform for Fixed-wing Aircraft Control with Reinforcement Learni…☆57Dec 17, 2024Updated last year
- Adversarial Reinforcement Learning papers (single-agent setting and multi-agent setting)☆76Sep 5, 2022Updated 3 years ago
- A Massively Parallel Large Scale Self-Play Framework☆361Jan 9, 2023Updated 3 years ago
- DELT: Data Efficacy for Language Model Training☆43Feb 12, 2026Updated 2 weeks ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆68Dec 8, 2025Updated 2 months ago
- ☆106Jul 20, 2025Updated 7 months ago
- Advanced control (iLQR, MPC, GNMS) examples with control toolbox in ROS☆24Apr 22, 2020Updated 5 years ago
- ☆25Sep 23, 2024Updated last year
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆35Jul 16, 2025Updated 7 months ago
- Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?☆34May 22, 2024Updated last year
- An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks☆44Updated this week
- ☆26Apr 21, 2023Updated 2 years ago
- ☆50Aug 27, 2025Updated 6 months ago
- Scalable drone simulation using JAX.☆62Jan 28, 2026Updated last month
- An environment based on JSBSIM aimed at one-to-one close air combat.☆451May 19, 2025Updated 9 months ago
- ☆37Apr 27, 2023Updated 2 years ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆33Sep 30, 2025Updated 5 months ago
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆13Jul 28, 2024Updated last year
- ☆20Oct 18, 2025Updated 4 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆40Jul 13, 2024Updated last year
- This is a source repository for Multi-Agent Reinforcement Learning for Autonomous Driving research☆40Sep 11, 2024Updated last year