l-mathur / social-aiLinks
Social-AI papers across computing communities, courses, and dissertations.
☆22Updated 5 months ago
Alternatives and similar repositories for social-ai
Users that are interested in social-ai are comparing it to the libraries listed below
Sorting:
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆52Updated last week
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆40Updated 2 months ago
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆35Updated last year
- ☆17Updated 5 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆37Updated last year
- HAZARD challenge☆36Updated 7 months ago
- ☆21Updated last year
- ☆52Updated 6 months ago
- maze datasets for investigating OOD behavior of ML systems☆64Updated last month
- MuMA-ToM: Multi-modal Multi-Agent Theory of Mind☆36Updated 10 months ago
- Lightweight Adapting for Black-Box Large Language Models☆24Updated last year
- ☆16Updated last month
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆37Updated last year
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆13Updated 10 months ago
- ☆132Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆25Updated last year
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year
- ☆19Updated last year
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Updated last year
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆29Updated last year
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆42Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆32Updated 6 months ago
- ☆54Updated last year
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆78Updated last year
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆47Updated last year
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆44Updated 8 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆69Updated 7 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆29Updated last year