sotopia-lab / sotopiaLinks
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
β239Updated last week
Alternatives and similar repositories for sotopia
Users that are interested in sotopia are comparing it to the libraries listed below
Sorting:
- An extensible benchmark for evaluating large language models on planningβ400Updated 2 months ago
- π Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Papβ¦β242Updated 2 weeks ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Modelsβ52Updated last year
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology Viewβ118Updated 2 months ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Larβ¦β140Updated 6 months ago
- Sotopia-Ο: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)β75Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)β148Updated 9 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]β340Updated last year
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.β287Updated last month
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'β125Updated 2 months ago
- Reasoning with Language Model is Planning with World Modelβ169Updated 2 years ago
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debateβ460Updated 4 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]β142Updated 9 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"β188Updated 4 months ago
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)β209Updated 2 years ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Mergingβ108Updated last year
- augmented LLM with self reflectionβ130Updated last year
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimizationβ165Updated last year
- [NeurIPS 2022] πWebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agentsβ389Updated 11 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasksβ312Updated 10 months ago
- β46Updated 6 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Aβ¦β47Updated last year
- LLM Agora, debating between open-source LLMs to refine the answersβ75Updated last year
- A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)β181Updated 3 weeks ago
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024β137Updated 6 months ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihaβ¦β127Updated last year
- β99Updated last year
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environmentsβ87Updated 3 months ago
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Modelsβ107Updated last month
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.β321Updated last year