l-mathur / social-ai
Accompanies the EMNLP 2024 paper: "Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions". This repo features Social-AI papers across computing communities, courses, and dissertations.
☆19Updated 3 months ago
Alternatives and similar repositories for social-ai:
Users that are interested in social-ai are comparing it to the libraries listed below
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆32Updated 7 months ago
- ☆11Updated last month
- [NeurIPS 2024] Introspective Planning: Aligning Robots’ Uncertainty with Inherent Task Ambiguity☆21Updated 3 months ago
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆25Updated 2 weeks ago
- Self-Supervised Alignment with Mutual Information☆17Updated 11 months ago
- MuMA-ToM: Multi-modal Multi-Agent Theory of Mind☆25Updated 3 months ago
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆28Updated 3 weeks ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆63Updated 11 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆23Updated last month
- Code for experiments on transformers using Markovian data.☆11Updated 5 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆19Updated 9 months ago
- ☆18Updated 9 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆11Updated last week
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆26Updated 2 years ago
- ☆18Updated 5 months ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆43Updated last year
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆28Updated 8 months ago
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated last month
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆16Updated 8 months ago
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆13Updated 2 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated 9 months ago
- ☆15Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆25Updated 6 months ago
- Holistic evaluation of multimodal foundation models☆47Updated 8 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆43Updated last week
- ☆20Updated 5 months ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆28Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 7 months ago