sotopia-lab / sotopiaLinks
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
β247Updated last week
Alternatives and similar repositories for sotopia
Users that are interested in sotopia are comparing it to the libraries listed below
Sorting:
- An extensible benchmark for evaluating large language models on planningβ410Updated 3 weeks ago
- π Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Papβ¦β248Updated 2 months ago
- Sotopia-Ο: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)β78Updated last year
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology Viewβ118Updated 4 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)β151Updated 11 months ago
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'β127Updated 4 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]β145Updated 10 months ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Larβ¦β144Updated 7 months ago
- Reasoning with Language Model is Planning with World Modelβ175Updated 2 years ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Modelsβ55Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"β193Updated 5 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]β353Updated last year
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihaβ¦β130Updated last year
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environmentsβ88Updated 5 months ago
- [NeurIPS 2022] πWebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agentsβ405Updated last year
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debateβ470Updated 5 months ago
- β48Updated 7 months ago
- [TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"β83Updated this week
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.β295Updated 2 months ago
- β207Updated 6 months ago
- LLM Agora, debating between open-source LLMs to refine the answersβ76Updated 2 years ago
- augmented LLM with self reflectionβ133Updated last year
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.β99Updated last week
- β239Updated last year
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024β139Updated 7 months ago
- Code for the paper π³ Tree Search for Language Model Agentsβ217Updated last year
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimizationβ169Updated last year
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)β211Updated 2 years ago
- [NeurIPS 2024] Agent Planning with World Knowledge Modelβ149Updated 9 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"β63Updated 8 months ago