sotopia-lab / sotopiaLinks
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
β223Updated this week
Alternatives and similar repositories for sotopia
Users that are interested in sotopia are comparing it to the libraries listed below
Sorting:
- π Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Papβ¦β206Updated 3 weeks ago
- Sotopia-Ο: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)β65Updated last year
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology Viewβ118Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]β136Updated 6 months ago
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debateβ437Updated last month
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"β74Updated last month
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]β320Updated last year
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.β264Updated 2 weeks ago
- VisualWebArena is a benchmark for multimodal agents.β347Updated 6 months ago
- An extensible benchmark for evaluating large language models on planningβ375Updated last month
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Larβ¦β131Updated 3 months ago
- β173Updated 2 months ago
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. β¦β137Updated last year
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Modelsβ52Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)β141Updated 7 months ago
- Function Vectors in Large Language Models (ICLR 2024)β167Updated last month
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)β262Updated last year
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimizationβ146Updated last year
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedbackβ108Updated 2 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safetyβ187Updated 10 months ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihaβ¦β124Updated last year
- LLM Agora, debating between open-source LLMs to refine the answersβ68Updated last year
- β276Updated 4 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"β177Updated last month
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervisionβ120Updated 8 months ago
- Self-Alignment with Principle-Following Reward Modelsβ161Updated 3 weeks ago
- Code for the paper π³ Tree Search for Language Model Agentsβ199Updated 10 months ago
- β133Updated last year
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Mergingβ106Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"β76Updated last year