yizhilll / CORGI-PMLinks
A Chinese corpus for gender bIas probing and mitigation, which contains 32.9k sentences with high-quality labels.
☆21Updated last year
Alternatives and similar repositories for CORGI-PM
Users that are interested in CORGI-PM are comparing it to the libraries listed below
Sorting:
- A new release of Chinese sexism dataset and lexicon☆12Updated 2 years ago
- repository for CharacterChat, a personalized social support system☆75Updated last year
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆132Updated last year
- LAiW: A Chinese Legal Large Language Models Benchmark☆82Updated last year
- ☆29Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆88Updated 9 months ago
- The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).☆53Updated 3 months ago
- [LREC] MMChat: Multi-Modal Chat Dataset on Social Media☆105Updated 2 years ago
- ☆26Updated 2 years ago
- ☆24Updated 10 months ago
- Code for the paper `Text Classification via Large Language Models`.☆82Updated 2 years ago
- 中文大语言模型评测第一期☆110Updated last year
- Official github repo for ACLUE, an evaluation benchmark focused on ancient Chinese language comprehension☆29Updated last year
- Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"☆101Updated 2 years ago
- The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection☆282Updated 2 years ago
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆59Updated last year
- 多轮共情对话模型PICA☆97Updated last year
- ☆64Updated 3 years ago
- ☆128Updated 2 years ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆92Updated last year
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆91Updated 6 months ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆102Updated 2 years ago
- Dataset for Findings of ACL 23 "VCSum: A Versatile Chinese Meeting Summarization Dataset"☆47Updated 2 years ago
- The source code of paper "CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking"☆78Updated 2 years ago
- 💡GENIUS – generating text using sketches! A strong text generation & data augmentation tool.☆177Updated 2 years ago
- ☆96Updated last year
- [COLING 2025] Official Repo for Paper "Beyond Boundaries: Learning Universal Entity Taxonomy across Datasets and Languages for Open Named…☆23Updated 2 months ago
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆53Updated 9 months ago
- ☆27Updated 2 years ago
- ☆145Updated last year