Sotopia-RL: Reward Design for Social Intelligence
☆49Apr 1, 2026Updated last week
Alternatives and similar repositories for sotopia-rl
Users that are interested in sotopia-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A platform to develop CTM-motivated AI architecture.☆16Updated this week
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆295Jan 23, 2026Updated 2 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆83May 7, 2024Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆60Mar 17, 2026Updated 3 weeks ago
- MMoE: Multimodal Mixture-of-Experts (EMNLP 2024)☆14Nov 14, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GPTSolver: question solving with selecting/screenshot☆15Mar 25, 2023Updated 3 years ago
- ☆31Sep 12, 2025Updated 7 months ago
- ☆46Jun 24, 2025Updated 9 months ago
- ☆27Feb 13, 2026Updated 2 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆77Sep 13, 2025Updated 7 months ago
- ☆22May 7, 2025Updated 11 months ago
- Adversarially Robust Generalization Just Requires More Unlabeled Data☆11Aug 8, 2019Updated 6 years ago
- [ICML 2025] ResearchTown: Simulator of Human Research Community☆201Updated this week
- ☆19Mar 10, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆24Oct 8, 2024Updated last year
- ☆33Jan 30, 2026Updated 2 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 8 months ago
- [WWW 2025 Oral] Large Language Models Empowered Personalized Web Agents.☆21Nov 11, 2025Updated 5 months ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- ☆19Oct 11, 2025Updated 6 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆55Sep 29, 2025Updated 6 months ago
- Efficient Scaling laws and collaborative pretraining.☆22Sep 18, 2025Updated 6 months ago
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆27Nov 25, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Self-Questioning Language Models☆56Mar 30, 2026Updated 2 weeks ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆161Jun 26, 2025Updated 9 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆23Mar 29, 2025Updated last year
- ☆34Oct 31, 2024Updated last year
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 7 months ago
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆132Mar 4, 2026Updated last month
- restore redis db from .rdb backup into docker container☆13Nov 2, 2016Updated 9 years ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Mar 5, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the robot-assisted feeding project at EmPRISE Lab☆28Apr 8, 2026Updated last week
- ICLR 2023 paper "Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness" by Yuancheng Xu, Yanchao Sun, Micah Gold…☆26May 2, 2023Updated 2 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)☆15Jun 14, 2025Updated 10 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆40Oct 20, 2025Updated 5 months ago
- [ICLR 2025] Pad: Personalized alignment of llms at decoding-time☆19Mar 19, 2025Updated last year
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆186Apr 2, 2026Updated last week