[ICML 2025] Official Code of SMPE: "Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration"
☆34Feb 9, 2026Updated 4 months ago
Alternatives and similar repositories for smpe
Users that are interested in smpe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code implementation of "Tree-based Focused Web Crawling with Reinforcement Learning" and the TRES framework☆24Feb 16, 2026Updated 3 months ago
- An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks☆55Updated this week
- Export Jupyter Notebooks to (Xe)LaTeX with Greek Support☆13Nov 25, 2018Updated 7 years ago
- LEGO Mindstorms 3D Printing-Milling Machine☆15Mar 14, 2018Updated 8 years ago
- Pytorch implementation of CVPR'16 paper "Learning Deep Representations of Fine-Grained Visual Descriptions", by Reed et al.☆18Aug 16, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient☆21Nov 13, 2024Updated last year
- Benchmark study of quality and faithfulness of counterfactual image generation☆30Apr 30, 2025Updated last year
- (ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolut…☆45Oct 14, 2023Updated 2 years ago
- This is the code for the paper Improved DDPG Based Two-Timescale Multi- Dimensional Resource Allocation for Multi-Access Edge Computing N…☆28May 6, 2025Updated last year
- DrillSat 2018☆16Oct 7, 2018Updated 7 years ago
- [ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding☆10Apr 15, 2025Updated last year
- [NeurIPS' 24] The PyTorch implementation of our paper: "Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learnin…☆21Oct 10, 2024Updated last year
- [ICML 2022] Official implementation of "Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems".☆12Jul 19, 2022Updated 3 years ago
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A simple sudoku solver☆17Sep 1, 2018Updated 7 years ago
- ☆14Jun 5, 2020Updated 6 years ago
- [Main EMNLP'25] LLMs do Multi-Label Classification Differently☆15Feb 28, 2026Updated 3 months ago
- Official reinforcement learning environment for demand response and grid services. This repository is based on, but distinct from the ori…☆33Oct 13, 2021Updated 4 years ago
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- Use tensorflow2 achieve PPO to play atari game☆13Oct 25, 2019Updated 6 years ago
- Momentum Contrast for Unsupervised Visual Representation Learning☆16Mar 24, 2023Updated 3 years ago
- Pytorch implementations of GMM - HMM☆10Dec 28, 2020Updated 5 years ago
- [ECMLPKDD 2020] "Topological Insights into Sparse Neural Networks"☆13May 2, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Oct 9, 2022Updated 3 years ago
- This repo contains the original implementation of VAuLT, the Vision-and-Augmented-Language Transformer. We provide instructions to downlo…☆18Sep 23, 2025Updated 8 months ago
- Ground Station for the CanSat in Greece competition☆30Apr 18, 2019Updated 7 years ago
- PRML Page-by-page配套资料,对PRML全书及各章节的review☆17Apr 16, 2024Updated 2 years ago
- ☆32May 15, 2022Updated 4 years ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆23Jul 6, 2023Updated 2 years ago
- 💫 Automated codification of Greek Legislation with NLP☆43Nov 22, 2022Updated 3 years ago
- The Emergence of Individuality☆13Oct 16, 2021Updated 4 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- (Official) PyTorch implementation for Trajectory-Class-Aware Multi-Agent Reinforcement Learning (ICLR 2025)☆28Nov 25, 2025Updated 6 months ago
- RLVR for LLMs in optimization modeling☆57Apr 15, 2026Updated last month
- ☆26Apr 16, 2024Updated 2 years ago
- [ICASSP'23] This repo contains code for the Demux & MEmo emotion recognition models (https://arxiv.org/abs/2210.15842), as well as code t…☆23Jan 18, 2024Updated 2 years ago
- Code for our paper LLaMAR: LM-based Long-Horizon Planner for Multi-Agent Robotics☆34Feb 10, 2025Updated last year
- Repo for the multi-agent PressurePlate environment☆19Feb 4, 2022Updated 4 years ago
- ☆18Jul 14, 2023Updated 2 years ago