nuochenpku / Awesome-Role-Play-Papers
Awesome papers for role-playing with language models
☆165Updated 3 months ago
Alternatives and similar repositories for Awesome-Role-Play-Papers:
Users that are interested in Awesome-Role-Play-Papers are comparing it to the libraries listed below
- ☆218Updated 3 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆337Updated 5 months ago
- repository for CharacterChat, a personalized social support system☆66Updated 7 months ago
- ☆72Updated 4 months ago
- RoleInteract: Evaluating the Social Interaction of Role-Playing Agents☆54Updated 4 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆64Updated this week
- ☆139Updated 7 months ago
- A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters…☆183Updated 8 months ago
- ☆209Updated 9 months ago
- ☆45Updated this week
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆188Updated 8 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆144Updated last year
- Generative Judge for Evaluating Alignment☆228Updated last year
- ☆257Updated 6 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆216Updated this week
- This is the repository for the Tool Learning survey.☆304Updated this week
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆103Updated 5 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated 2 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆159Updated 6 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆117Updated 8 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆141Updated 5 months ago
- ☆174Updated 9 months ago
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆240Updated last year
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆129Updated 5 months ago
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆40Updated 9 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆241Updated 2 months ago
- A series of technical report on Slow Thinking with LLM☆409Updated last week
- Collection of papers for scalable automated alignment.☆82Updated 3 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆233Updated 3 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆136Updated 7 months ago