TaoZhen1110 / CAT-LLM
☆21Updated 4 months ago
Alternatives and similar repositories for CAT-LLM:
Users that are interested in CAT-LLM are comparing it to the libraries listed below
- repository for CharacterChat, a personalized social support system☆67Updated 8 months ago
- ☆62Updated 2 years ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆75Updated 4 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆120Updated 9 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆70Updated 3 weeks ago
- Proactive Dialogue Systems - Paper Reading List☆52Updated last year
- OMGEval😮: An Open Multilingual Generative Evaluation Benchmark for Foundation Models☆32Updated 7 months ago
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆13Updated 3 months ago
- Towards Quantifiable Dialogue Coherence Evaluation (ACL 2021)☆62Updated 3 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated last year
- A Bilingual Role Evaluation Benchmark for Large Language Models☆39Updated last year
- Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"☆34Updated last year
- ☆70Updated 2 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆112Updated 3 months ago
- 中文大语言模型评测第二期☆70Updated last year
- ☆25Updated last year
- 历届中文句法错误诊断技术评测数据集☆38Updated 2 years ago
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆22Updated last year
- The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).☆31Updated 2 months ago
- ☆59Updated last year
- This is the repo which record the evolution of LM-based dialogue system. More details can be found in our original survey paper: A Survey…☆57Updated last week
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆53Updated last year
- A Chinese corpus for gender bIas probing and mitigation, which contains 32.9k sentences with high-quality labels.☆19Updated 6 months ago
- ☆97Updated 11 months ago
- ☆15Updated last year
- ☆93Updated last year
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆97Updated 3 months ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆15Updated last year
- ☆17Updated 3 years ago
- ☆31Updated last year