sotopia-lab / sotopia
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
β176Updated this week
Alternatives and similar repositories for sotopia:
Users that are interested in sotopia are comparing it to the libraries listed below
- Sotopia-Ο: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)β55Updated 8 months ago
- π Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Papβ¦β134Updated last month
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"β124Updated 9 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]β106Updated last month
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.β234Updated 3 months ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Mergingβ98Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervisionβ111Updated 4 months ago
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and trainingβ251Updated 7 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology Viewβ106Updated 8 months ago
- β265Updated last week
- Self-Alignment with Principle-Following Reward Modelsβ150Updated 10 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimizationβ125Updated 8 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)β109Updated 2 months ago
- An extensible benchmark for evaluating large language models on planningβ313Updated 7 months ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Modelsβ49Updated last year
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Zihaβ¦β110Updated 7 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correctβ138Updated last month
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)β250Updated 9 months ago
- Reasoning with Language Model is Planning with World Modelβ154Updated last year
- [NeurIPS 2024] Agent Planning with World Knowledge Modelβ98Updated last month
- [NeurIPS 2022] πWebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agentsβ297Updated 4 months ago
- β75Updated 6 months ago
- augmented LLM with self reflectionβ109Updated last year
- Repo of paper "Free Process Rewards without Process Labels"β94Updated this week
- An Analytical Evaluation Board of Multi-turn LLM Agentsβ270Updated 7 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.β272Updated 5 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasksβ294Updated 2 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safetyβ174Updated 5 months ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agenβ¦β267Updated last year
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedbackβ204Updated last year