Sotopia-RL: Reward Design for Social Intelligence
☆50Apr 1, 2026Updated 2 months ago
Alternatives and similar repositories for sotopia-rl
Users that are interested in sotopia-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A platform to develop CTM-motivated AI architecture.☆18May 12, 2026Updated last month
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆84May 7, 2024Updated 2 years ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆69Mar 17, 2026Updated 2 months ago
- GPTSolver: question solving with selecting/screenshot☆15Mar 25, 2023Updated 3 years ago
- ☆47Jun 24, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆27May 30, 2026Updated 2 weeks ago
- Tutorial for BeeInvaders game on the Basys3 FPGA board☆12Jul 17, 2023Updated 2 years ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆86Sep 13, 2025Updated 9 months ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆206Updated this week
- ☆19Mar 10, 2025Updated last year
- FPGA Games Gathered from GitHub in one place☆22Aug 19, 2022Updated 3 years ago
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆25Oct 8, 2024Updated last year
- [WWW 2025 Oral] Large Language Models Empowered Personalized Web Agents.☆22Nov 11, 2025Updated 7 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- ☆40Jan 30, 2026Updated 4 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆58Sep 29, 2025Updated 8 months ago
- Efficient Scaling laws and collaborative pretraining.☆22Sep 18, 2025Updated 8 months ago
- Self-Questioning Language Models☆56Mar 30, 2026Updated 2 months ago
- ☆21Oct 11, 2025Updated 8 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆23Mar 29, 2025Updated last year
- ☆34Oct 31, 2024Updated last year
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆136Mar 4, 2026Updated 3 months ago
- restore redis db from .rdb backup into docker container☆13Nov 2, 2016Updated 9 years ago
- A collection of resources that investigate social agents.☆235Apr 22, 2025Updated last year
- Benchmarking Social Intelligence of Language Agents through Interactive Scenarios☆13Jan 4, 2025Updated last year
- ☆179Nov 24, 2025Updated 6 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆53Dec 13, 2025Updated 6 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 7 months ago
- [ICLR 2025] Pad: Personalized alignment of llms at decoding-time☆20Mar 19, 2025Updated last year
- ☆23Sep 19, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆199Apr 2, 2026Updated 2 months ago
- A unified robotic manipulation learning framework☆22Sep 4, 2025Updated 9 months ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Official release for SplArt: Articulation Estimation and Part-level Reconstruction with 3D Gaussian Splatting.☆31Jun 5, 2025Updated last year
- [SIGGRAPH Asia 2025] CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling☆49Apr 17, 2026Updated last month
- (NIPS 2025) OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Align…☆140May 9, 2026Updated last month
- Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda☆18Apr 29, 2026Updated last month