Sotopia-RL: Reward Design for Social Intelligence
☆52Apr 1, 2026Updated 3 months ago
Alternatives and similar repositories for sotopia-rl
Users that are interested in sotopia-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A platform to develop CTM-motivated AI architecture.☆18May 12, 2026Updated last month
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆314Jun 5, 2026Updated 3 weeks ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆85May 7, 2024Updated 2 years ago
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆16Nov 14, 2024Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆70Mar 17, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆31Sep 12, 2025Updated 9 months ago
- ☆47Jun 24, 2025Updated last year
- ☆28May 30, 2026Updated last month
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆86Sep 13, 2025Updated 9 months ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆208Updated this week
- ☆19Mar 10, 2025Updated last year
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆25Oct 8, 2024Updated last year
- [WWW 2025 Oral] Large Language Models Empowered Personalized Web Agents.☆22Nov 11, 2025Updated 7 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- ☆41Jan 30, 2026Updated 5 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆57Sep 29, 2025Updated 9 months ago
- Efficient Scaling laws and collaborative pretraining.☆22Sep 18, 2025Updated 9 months ago
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆26Nov 25, 2025Updated 7 months ago
- ☆21Oct 11, 2025Updated 8 months ago
- Self-Questioning Language Models☆57Mar 30, 2026Updated 3 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆24Mar 29, 2025Updated last year
- [ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models☆165Jun 26, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆34Oct 31, 2024Updated last year
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 10 months ago
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆135Mar 4, 2026Updated 4 months ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated 2 years ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Mar 5, 2026Updated 3 months ago
- Benchmarking Social Intelligence of Language Agents through Interactive Scenarios☆13Jan 4, 2025Updated last year
- ☆180Nov 24, 2025Updated 7 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 6 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆23Sep 19, 2024Updated last year
- A unified robotic manipulation learning framework☆23Sep 4, 2025Updated 9 months ago
- Code for the robot-assisted feeding project at EmPRISE Lab☆30Jun 26, 2026Updated last week
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆26Apr 26, 2026Updated 2 months ago
- Official release for SplArt: Articulation Estimation and Part-level Reconstruction with 3D Gaussian Splatting.☆32Jun 5, 2025Updated last year
- [SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling☆49Apr 17, 2026Updated 2 months ago