Stanford-ILIAD / Diverse-ConventionsView external linksLinks
Exploring techniques to generate diverse conventions in multi-agent settings
☆15Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for Diverse-Conventions
Users that are interested in Diverse-Conventions are comparing it to the libraries listed below
Sorting:
- ☆41Jan 9, 2024Updated 2 years ago
- CookingZoo: a gym-cooking derivative to simulate a complex cooking environment☆21Dec 6, 2024Updated last year
- Overcooked human-AI experiment platform☆39Dec 21, 2023Updated 2 years ago
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"☆110Apr 17, 2023Updated 2 years ago
- Collection of RL Environments built using Madrona☆37Aug 11, 2023Updated 2 years ago
- This is the official implementation of paper "Leveraging Dual Process Theory in Language Agent Framework for Simultaneous Human-AI Collab…☆52Nov 22, 2025Updated 2 months ago
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆54Nov 22, 2025Updated 2 months ago
- A distributed GPU-centric experience replay system for large AI models.☆18Aug 1, 2023Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- ☆20Apr 5, 2023Updated 2 years ago
- Repo for the Greedy when Sure and Conservative when Uncertain about the Opponents (GSCU)☆25Aug 4, 2022Updated 3 years ago
- AAAI24(Oral) ProAgent: Building Proactive Cooperative Agents with Large Language Models☆97Mar 4, 2025Updated 11 months ago
- Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…☆45Oct 29, 2020Updated 5 years ago
- 🏆 gym-cooking: Code for "Too many cooks: Bayesian inference for coordinating multi-agent collaboration", Winner of the CogSci 2020 Compu…☆218Apr 25, 2021Updated 4 years ago
- ☆28Updated this week
- PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tu…☆157Nov 6, 2023Updated 2 years ago
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆67Aug 9, 2024Updated last year
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- Generalized Proximal Policy Optimization with Sample Reuse (GePPO)☆28Jul 24, 2023Updated 2 years ago
- Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)☆32Nov 22, 2025Updated 2 months ago
- ☆12Nov 21, 2025Updated 2 months ago
- Exploratory Data Analysis of Time Series Data and Forecasting using Naïve Approach, Moving Average Method, Simple Exponential Smoothenin…☆12Jul 2, 2018Updated 7 years ago
- Overcooked! 2 TAS Development Framework☆10Aug 18, 2023Updated 2 years ago
- ☆20Oct 18, 2025Updated 3 months ago
- Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)☆33Oct 6, 2022Updated 3 years ago
- World Model for Natural Gas Trade☆10Feb 8, 2018Updated 8 years ago
- # Supporting-Emergency-Room-Decision-Making-with-Relevant-Scientific-Literature #### Supervised by: Yassine Benajiba #### Course: Introdu…☆10Jan 19, 2018Updated 8 years ago
- Transcribing long blocks of speech using Watson Speech To Text.☆11Sep 24, 2020Updated 5 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Provides fully configure Visual Studio Solution for ORTools☆10Aug 30, 2019Updated 6 years ago
- Solutions to assignments in course- "Bitcoin and Cryptocurrency Technologies", offered by coursera, Princeton University☆11Jun 28, 2018Updated 7 years ago
- R script for visualising patient ward movements as timelines☆13May 13, 2022Updated 3 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆42Jan 13, 2024Updated 2 years ago
- This is the source code of RPG (Reward-Randomized Policy Gradient)☆42Sep 1, 2022Updated 3 years ago
- MeMAD multimodal content analysis and machine translation: collection of tools and libraries☆12May 17, 2021Updated 4 years ago
- EMNLP 2022: Biomedical NER for the Enterprise with Distillated BERN2 and the Kazu Framework☆11Aug 29, 2024Updated last year
- Improvement for Modular Camera based Tactile Sensor, with integrated circuit, optimized illumination, and biomimetic markers.☆15Feb 14, 2024Updated 2 years ago
- My solution code to parallel architecture and programming Spring 2016☆12Aug 15, 2016Updated 9 years ago
- Celery plugin to autoscale based on available CPU, memory, or other system attributes.☆11Dec 8, 2017Updated 8 years ago