Sotopia-RL: Reward Design for Social Intelligence
☆50Apr 1, 2026Updated last month
Alternatives and similar repositories for sotopia-rl
Users that are interested in sotopia-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13May 15, 2025Updated 11 months ago
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆15Nov 14, 2024Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆63Mar 17, 2026Updated last month
- GPTSolver: question solving with selecting/screenshot☆15Mar 25, 2023Updated 3 years ago
- ☆31Sep 12, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆47Jun 24, 2025Updated 10 months ago
- ☆27Feb 13, 2026Updated 2 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆79Sep 13, 2025Updated 7 months ago
- ☆22May 7, 2025Updated 11 months ago
- Adversarially Robust Generalization Just Requires More Unlabeled Data☆11Aug 8, 2019Updated 6 years ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆202Updated this week
- ☆19Mar 10, 2025Updated last year
- ☆34Jan 30, 2026Updated 3 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [WWW 2025 Oral] Large Language Models Empowered Personalized Web Agents.☆21Nov 11, 2025Updated 5 months ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆56Sep 29, 2025Updated 7 months ago
- Self-Questioning Language Models☆56Mar 30, 2026Updated last month
- ☆19Oct 11, 2025Updated 6 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆22Mar 29, 2025Updated last year
- [ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models☆163Jun 26, 2025Updated 10 months ago
- ☆34Oct 31, 2024Updated last year
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- restore redis db from .rdb backup into docker container☆13Nov 2, 2016Updated 9 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Mar 5, 2026Updated last month
- Code for the robot-assisted feeding project at EmPRISE Lab☆29Apr 23, 2026Updated last week
- A collection of resources that investigate social agents.☆232Apr 22, 2025Updated last year
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆53Dec 13, 2025Updated 4 months ago
- TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)☆15Jun 14, 2025Updated 10 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆40Oct 20, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICLR 2025] Pad: Personalized alignment of llms at decoding-time☆19Mar 19, 2025Updated last year
- ☆23Sep 19, 2024Updated last year
- A unified robotic manipulation learning framework☆22Sep 4, 2025Updated 8 months ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Apr 26, 2026Updated last week
- [SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling☆48Apr 17, 2026Updated 2 weeks ago
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆21Oct 22, 2025Updated 6 months ago
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…☆137Nov 8, 2025Updated 5 months ago