nuochenpku / Awesome-Role-Play-PapersLinks
Awesome papers for role-playing with language models
☆195Updated 9 months ago
Alternatives and similar repositories for Awesome-Role-Play-Papers
Users that are interested in Awesome-Role-Play-Papers are comparing it to the libraries listed below
Sorting:
- ☆257Updated 2 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆381Updated last month
- ☆98Updated 9 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆265Updated last year
- RoleInteract: Evaluating the Social Interaction of Role-Playing Agents☆57Updated 9 months ago
- repository for CharacterChat, a personalized social support system☆72Updated last year
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆115Updated last month
- Code and Data for EMNLP 2024 Paper "Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent"☆130Updated last week
- Generative Judge for Evaluating Alignment☆244Updated last year
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆455Updated 6 months ago
- ☆144Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆561Updated 7 months ago
- A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters…☆201Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆129Updated 10 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆89Updated 5 months ago
- [ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models☆83Updated 2 months ago
- The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>☆341Updated last year
- ☆298Updated last year
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆58Updated 2 months ago
- Collection of training data management explorations for large language models☆329Updated last year
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆415Updated 6 months ago
- A live reading list for LLM-synthetic-data.☆343Updated this week
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆156Updated 2 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆131Updated last year
- papers related to LLM-agent that published on top conferences☆315Updated 3 months ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆225Updated last year
- ☆323Updated last year
- ☆18Updated last year
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆58Updated last year
- ☆328Updated last year