This is a repository for Hidden-utility Self-Play.
☆26Jul 27, 2023Updated 2 years ago
Alternatives and similar repositories for HSP
Users that are interested in HSP are comparing it to the libraries listed below
Sorting:
- Overcooked human-AI experiment platform☆39Dec 21, 2023Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Nov 29, 2022Updated 3 years ago
- Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method☆47Sep 11, 2024Updated last year
- ☆12Jan 4, 2024Updated 2 years ago
- (AAAI24 oral) Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)☆12May 22, 2023Updated 2 years ago
- ☆14Jul 12, 2021Updated 4 years ago
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆158Nov 6, 2023Updated 2 years ago
- The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".☆47Oct 31, 2024Updated last year
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Apr 17, 2023Updated 2 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆26Jan 27, 2026Updated last month
- Collection of RL Environments built using Madrona☆37Aug 11, 2023Updated 2 years ago
- [ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning☆32Jun 1, 2023Updated 2 years ago
- Official implementation of the paper "On the Importance of Environments in Human-Robot Coordination", published in RSS 2021.☆16May 1, 2024Updated last year
- Examples of some fast databinding in Angular2☆11Jun 26, 2015Updated 10 years ago
- An environment for table-carrying, a joint-action cooperative task.☆10Jan 8, 2024Updated 2 years ago
- ☆17Dec 23, 2025Updated 2 months ago
- ☆13Dec 14, 2024Updated last year
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆24Jun 8, 2025Updated 8 months ago
- ☆12May 14, 2024Updated last year
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- Code for "Traffic Signal Cycle Control with Centralized Critic and Decentralized Actors under Varying Intervention Frequencies"☆11Jun 27, 2025Updated 8 months ago
- This is a Human-like Upper-limb Motion Planner (HUMP) for the generation of arm-hand movements in humanoid robots.☆11Mar 4, 2022Updated 3 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- ☆15Apr 6, 2022Updated 3 years ago
- Official implementation of Constrained Mean Shift Clustering☆12Feb 23, 2022Updated 4 years ago
- Companion code for ICML 2022 paper "Imitation Learning by Estimating Expertise of Demonstrators"☆11Jul 5, 2023Updated 2 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- 中国常用大地测量(投影)坐标系相互转换☆10Feb 14, 2020Updated 6 years ago
- Official implementation of "Learned Fourier Bases for Deep Set Feature Extractors in Automotive Reinforcement Learning"☆15Feb 23, 2024Updated 2 years ago
- suPER is a collaborative multi-agent RL algorithm☆14Jun 11, 2024Updated last year
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆19Sep 12, 2025Updated 5 months ago
- ☆485Dec 28, 2023Updated 2 years ago
- Test Env for Security Function Chaining☆11Apr 19, 2023Updated 2 years ago
- Accompanying code for the 2019 CNSM paper "Predicting VNF Deployment Decisions under Dynamically Changing Network Conditions".☆12Aug 22, 2019Updated 6 years ago
- Official implementation of VLMLight☆29Jul 31, 2025Updated 7 months ago
- Automatically emulate network service placements calculated by arbitrary placement algorithms☆12Nov 1, 2021Updated 4 years ago
- PyTorch implementations of Reinforcement Learning algorithms in less than 200 lines☆10Apr 3, 2020Updated 5 years ago
- Dual optimization to learn laplacian eigenpairs in arbitrary spaces☆16Dec 18, 2024Updated last year
- This is a project for creating and using IL datasets based on HuggingFace weights with multithreads for performance, and benchmarking☆13Apr 26, 2025Updated 10 months ago