Paitesanshi / CharacterBoxLinks
☆22Updated last year
Alternatives and similar repositories for CharacterBox
Users that are interested in CharacterBox are comparing it to the libraries listed below
Sorting:
- ☆110Updated last year
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆68Updated 5 months ago
- Personality Alignment of Language Models☆52Updated 7 months ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆26Updated last year
- ☆137Updated last month
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)☆24Updated 3 months ago
- Awesome papers for role-playing with language models☆216Updated last year
- [ICLR 2026] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆47Updated 7 months ago
- ☆59Updated last year
- [ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models☆108Updated 8 months ago
- [AAAI'25] SPRING: Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models☆24Updated 4 months ago
- This repository provides the data and the codes used in the AAAI'24 paper, COOPER: Coordinating Specialized Agents towards a Complex Dial…☆25Updated last year
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆101Updated 11 months ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆64Updated 8 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆83Updated last year
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆209Updated 9 months ago
- RoleInteract: Evaluating the Social Interaction of Role-Playing Agents☆67Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆151Updated last year
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)☆31Updated 3 months ago
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆60Updated last year
- Code and data for the paper: On the Reliability of Psychological Scales on Large Language Models☆30Updated last month
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Updated last year
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆73Updated 6 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆85Updated last year
- [ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO☆62Updated 9 months ago
- Proactive Dialogue Systems - Paper Reading List☆66Updated 2 years ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆64Updated last year
- [NAACL 2025] The implementation of paper "Hello Again! LLM-powered Personalized Agent for Long-term Dialogue".☆74Updated 8 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆168Updated last year
- ☆51Updated last year