sotopia-lab/sotopia-rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sotopia-lab/sotopia-rl)

sotopia-lab / sotopia-rl

Sotopia-RL: Reward Design for Social Intelligence

☆52

Alternatives and similar repositories for sotopia-rl

Users that are interested in sotopia-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ulab-uiuc / social-world-model
View on GitHub
[ICML 2026]: Building Social World Models with Large Language Models
☆22Jun 9, 2026Updated last month
consciousness-lab / ctm-ai
View on GitHub
A platform to develop CTM-motivated AI architecture.
☆18May 12, 2026Updated 2 months ago
sotopia-lab / sotopia
View on GitHub
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
☆317Jun 5, 2026Updated last month
ulab-uiuc / research-town
View on GitHub
[ICML 2025] ResearchTown: Simulator of Human Research Community
☆207Updated this week
sotopia-lab / sotopia-pi
View on GitHub
Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)
☆85May 7, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xiusic / DecisionFlow
View on GitHub
☆34Aug 26, 2025Updated 10 months ago
ulab-uiuc / tiny-scientist
View on GitHub
[EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents
☆136Mar 4, 2026Updated 4 months ago
spiral-rl / spiral
View on GitHub
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
☆199Mar 27, 2026Updated 3 months ago
DualityRL / multi-attempt
View on GitHub
☆19Mar 10, 2025Updated last year
rhyang2021 / ARIA
View on GitHub
Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".
☆30Aug 9, 2025Updated 11 months ago
spinbench / spinbench
View on GitHub
☆28May 30, 2026Updated last month
G-JWLee / TAMP
View on GitHub
☆12May 15, 2025Updated last year
lwaekfjlk / mmoe
View on GitHub
MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)
☆16Nov 14, 2024Updated last year
SalesforceAIResearch / LoCoBench-Agent
View on GitHub
LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
☆22Jun 2, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
guoyaol / GptSolver
View on GitHub
GPTSolver: question solving with selecting/screenshot
☆15Mar 25, 2023Updated 3 years ago
sunblaze-ucb / omega
View on GitHub
☆47Jun 24, 2025Updated last year
MozerWang / AMPO
View on GitHub
[ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents
☆51Feb 2, 2026Updated 5 months ago
ulab-uiuc / MARBLE
View on GitHub
(ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…
☆280Oct 27, 2025Updated 8 months ago
ulab-uiuc / live-trade-bench
View on GitHub
Live evaluation of trading agents
☆161Feb 17, 2026Updated 5 months ago
RuntianZ / adversarial-robustness-unlabeled
View on GitHub
Adversarially Robust Generalization Just Requires More Unlabeled Data
☆11Aug 8, 2019Updated 6 years ago
Xt-cyh / CoDI-Eval
View on GitHub
☆22May 7, 2025Updated last year
lwaekfjlk / python-project-template
View on GitHub
Template for project development.
☆14Updated this week
mnoukhov / async_rlhf
View on GitHub
Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models
☆68Mar 5, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ZhaolinGao / REFUEL
View on GitHub
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
☆25Oct 8, 2024Updated last year
Haotianz94 / smpl_visualizer
View on GitHub
☆13Sep 20, 2023Updated 2 years ago
ulab-uiuc / Multi-agent-evolve
View on GitHub
☆153Jan 21, 2026Updated 6 months ago
kaustpradalab / repeat-curse-llm
View on GitHub
[ACL 2025 Findings] Understanding the Repeat Curse in Large Language Models from a Feature Perspective
☆21Jun 13, 2025Updated last year
Infini-AI-Lab / Sparrow
View on GitHub
☆16Jun 15, 2026Updated last month
FeiSun / LaTeX-Drawing
View on GitHub
LaTeX Drawing
☆18Dec 22, 2025Updated 7 months ago
kaustpradalab / LLM-Persona-Steering
View on GitHub
Official code of "Exploring the Personality Traits of LLMs through Latent Features Steering"
☆18Jan 30, 2025Updated last year
TextArena / UnstableBaselines
View on GitHub
☆120Apr 7, 2026Updated 3 months ago
FudanDISC / SocialAgent
View on GitHub
A collection of resources that investigate social agents.
☆241Apr 22, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Yarayx / livelongbench
View on GitHub
The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…
☆12Jun 28, 2025Updated last year
zaydzuhri / token-order-prediction
View on GitHub
Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"
☆48May 13, 2026Updated 2 months ago
ritzz-ai / PACS
View on GitHub
☆31Sep 12, 2025Updated 10 months ago
ulab-uiuc / coinjure
View on GitHub
Coinjure: A Trading Agent Harness for Prediction Markets
☆35May 5, 2026Updated 2 months ago
sotopia-lab / awesome-social-agents
View on GitHub
A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.
☆113Jun 14, 2026Updated last month
TIGER-AI-Lab / Hierarchical-Reasoner
View on GitHub
Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]
☆64Apr 11, 2026Updated 3 months ago
SALT-NLP / PersuationGames
View on GitHub
[ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…
☆16Feb 22, 2025Updated last year