l-mathur / social-ai
Accompanies the EMNLP 2024 paper: "Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions". This repo features Social-AI papers across computing communities, courses, and dissertations.
☆16Updated 2 months ago
Alternatives and similar repositories for social-ai:
Users that are interested in social-ai are comparing it to the libraries listed below
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆30Updated 3 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆15Updated 6 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆33Updated 9 months ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆20Updated 11 months ago
- ☆15Updated last month
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆19Updated 2 months ago
- ☆11Updated 2 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆25Updated last year
- ☆15Updated 2 months ago
- Introspective Planning: Guiding Language-Enabled Agents to Refine Their Own Uncertainty☆19Updated this week
- Self-Supervised Alignment with Mutual Information☆16Updated 7 months ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated last year
- ☆15Updated 2 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆39Updated last year
- HAZARD challenge☆27Updated 8 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 4 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆41Updated 5 months ago
- Official code for the paper: WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents☆27Updated last month
- ☆24Updated 6 months ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆35Updated 9 months ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆22Updated 5 months ago
- ☆13Updated 4 months ago
- ☆43Updated 2 weeks ago
- ☆14Updated 2 years ago
- ☆15Updated 2 months ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆29Updated 5 months ago
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆28Updated 9 months ago