Sotopia-RL: Reward Design for Social Intelligence
☆50Apr 1, 2026Updated last month
Alternatives and similar repositories for sotopia-rl
Users that are interested in sotopia-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆305Jan 23, 2026Updated 4 months ago
- ☆13May 15, 2025Updated last year
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆84May 7, 2024Updated 2 years ago
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆16Nov 14, 2024Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆65Mar 17, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GPTSolver: question solving with selecting/screenshot☆15Mar 25, 2023Updated 3 years ago
- ☆47Jun 24, 2025Updated 11 months ago
- ☆27Feb 13, 2026Updated 3 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆85Sep 13, 2025Updated 8 months ago
- Adversarially Robust Generalization Just Requires More Unlabeled Data☆11Aug 8, 2019Updated 6 years ago
- ☆19Mar 10, 2025Updated last year
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆25Oct 8, 2024Updated last year
- ☆38Jan 30, 2026Updated 3 months ago
- [WWW 2025 Oral] Large Language Models Empowered Personalized Web Agents.☆21Nov 11, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 9 months ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆58Sep 29, 2025Updated 7 months ago
- Efficient Scaling laws and collaborative pretraining.☆22Sep 18, 2025Updated 8 months ago
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆27Nov 25, 2025Updated 5 months ago
- ☆20Oct 11, 2025Updated 7 months ago
- [ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models☆164Jun 26, 2025Updated 10 months ago
- ☆34Oct 31, 2024Updated last year
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LEMMA: An effective and explainable way to detect multimodal misinformation with LVLM and external knowledge augmentation, incorporating …☆23Jun 4, 2025Updated 11 months ago
- restore redis db from .rdb backup into docker container☆13Nov 2, 2016Updated 9 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated 2 years ago
- Code for the robot-assisted feeding project at EmPRISE Lab☆29Apr 23, 2026Updated last month
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Mar 5, 2026Updated 2 months ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- Benchmarking Social Intelligence of Language Agents through Interactive Scenarios☆13Jan 4, 2025Updated last year
- TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)☆15Jun 14, 2025Updated 11 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 7 months ago
- [ICLR 2025] Pad: Personalized alignment of llms at decoding-time☆19Mar 19, 2025Updated last year
- ☆23Sep 19, 2024Updated last year
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆195Apr 2, 2026Updated last month
- A unified robotic manipulation learning framework☆22Sep 4, 2025Updated 8 months ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Apr 26, 2026Updated 3 weeks ago
- Official release for SplArt: Articulation Estimation and Part-level Reconstruction with 3D Gaussian Splatting.☆31Jun 5, 2025Updated 11 months ago