Neph0s / InCharacter
Official code for the paper: InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews (previously: Do Role-Playing Chatbots Capture the Character Personalities? Assessing Personality Traits for Role-Playing Chatbots)
☆71Updated 6 months ago
Alternatives and similar repositories for InCharacter:
Users that are interested in InCharacter are comparing it to the libraries listed below
- repository for CharacterChat, a personalized social support system☆69Updated 9 months ago
- RoleInteract: Evaluating the Social Interaction of Role-Playing Agents☆55Updated 6 months ago
- Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"☆123Updated last week
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆52Updated last week
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆49Updated last year
- Awesome papers for role-playing with language models☆181Updated 5 months ago
- ☆234Updated 4 months ago
- A Bilingual Role Evaluation Benchmark for Large Language Models☆40Updated last year
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated 4 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 6 months ago
- The official repository of the Omni-MATH benchmark.☆80Updated 3 months ago
- This is the official repository for the paper "EmoBench: Evaluating the Emotional Intelligence of Large Language Models"☆70Updated last year
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 10 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆62Updated 11 months ago
- ☆36Updated 7 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- Reformatted Alignment☆115Updated 6 months ago
- ☆49Updated last year
- A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters…☆190Updated 10 months ago
- Unofficial implementation of AlpaGasus☆90Updated last year
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆27Updated last year
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆53Updated last year
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆35Updated 8 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆18Updated 5 months ago
- Just for debug☆56Updated last year
- Code and Data for the paper "Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works".☆17Updated 8 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆46Updated 9 months ago
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆80Updated last year
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)☆23Updated 8 months ago