yizhilll / CORGI-PMLinks
A Chinese corpus for gender bIas probing and mitigation, which contains 32.9k sentences with high-quality labels.
☆22Updated last year
Alternatives and similar repositories for CORGI-PM
Users that are interested in CORGI-PM are comparing it to the libraries listed below
Sorting:
- A new release of Chinese sexism dataset and lexicon☆12Updated 2 years ago
- repository for CharacterChat, a personalized social support system☆75Updated last year
- 多轮共情对话模型PICA☆97Updated 2 years ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆135Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆84Updated last year
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆105Updated 3 years ago
- ☆25Updated 11 months ago
- The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).☆53Updated 4 months ago
- ☆26Updated 2 years ago
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆60Updated last year
- ☆30Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆89Updated 11 months ago
- The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection☆289Updated 2 years ago
- 中文对话数据清洗☆30Updated 2 years ago
- 中文大语言模型评测第一期☆110Updated last year
- The repository of EMNLP 2023 "A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check"☆18Updated last year
- Code for the paper `Text Classification via Large Language Models`.☆83Updated 2 years ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆93Updated last year
- 💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.☆180Updated 2 years ago
- 中文 Instruction tuning datasets☆136Updated last year
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆102Updated 2 years ago
- ☆96Updated 2 years ago
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆84Updated last year
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆119Updated 10 months ago
- Source code for ACL 2023 paper Decoder Tuning: Efficient Language Understanding as Decoding☆51Updated 2 years ago
- Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]☆23Updated last year
- Rephrasing Language Model for CSC (AAAI 2024)☆41Updated last year
- Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"☆49Updated 2 years ago
- ☆127Updated 2 years ago
- CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AI | 中文个性情感对话数据集☆257Updated 2 years ago