RXJ588 / CHSD
仇恨言论语料库
☆20Updated last year
Alternatives and similar repositories for CHSD
Users that are interested in CHSD are comparing it to the libraries listed below
Sorting:
- CCL 2020 中文隐喻识别与情感分析任务说明与数据集☆39Updated 4 years ago
- The first Chinese metaphor corpus serving for identification and generation. 中文比喻数据集. Presented at COLING 2022.☆42Updated 2 years ago
- The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark"…☆72Updated 4 months ago
- The code and resource of "Towards Comprehensive Detection of Chinese Harmful Memes" (NeurIPS2024 D&B).☆37Updated 4 months ago
- 评估自然语言的流畅度☆115Updated 3 years ago
- A new release of Chinese sexism dataset and lexicon☆10Updated last year
- 基于GOOGLE T5中文生成式模型的摘要生成/指代消解,支持batch批量生成,多进程☆222Updated last year
- The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection☆266Updated 2 years ago
- ☆30Updated last year
- ☆27Updated 9 months ago
- This repository stores the code of the data augmentation method from Chinese word and character levels, which adds noise to words and cha…☆18Updated 2 years ago
- A large high-quality corpus of Chinese synonyms 一个大型、高质量的中文同义词语料库。☆50Updated 3 years ago
- Yet Another Chinese Learner Corpus☆77Updated 3 years ago
- 中文机器阅读理解数据集☆103Updated 4 years ago
- [COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集☆628Updated last year
- 基于SpanBert的中文指代消解,pytorch实现☆97Updated 2 years ago
- Code for the paper `Text Classification via Large Language Models`.☆81Updated last year
- A Chinese corpus for gender bIas probing and mitigation, which contains 32.9k sentences with high-quality labels.☆19Updated 8 months ago
- SikuBERT:四库全书的预训练语言模型(四库BERT) Pre-training Model of Siku Quanshu☆129Updated last year
- 无监督中文关键词抽取(Keyphrase Extraction),基于统计,基于图【LDA与PageRank(TextRank, TPR, Salience Rank, Single TPR等)】,基于嵌入【SIFRank等】,开箱即用!☆105Updated 2 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆115Updated 5 months ago
- ☆17Updated 2 years ago
- 中文文本可读性分级数据集☆13Updated last year
- experiments of some semantic matching models and comparison of experimental results.☆161Updated last year
- a Corpus for Classical Chinese Language Event Extraction☆21Updated last year
- Source code and dataset for ACL2022 Findings Paper "LEVEN: A Large-Scale Chinese Legal Event Detection dataset"☆111Updated last year
- 爬取各种数据的爬虫的样例(百度百科、知乎、微博、简书、搜狗词库),可用于自然语言处理语料收集☆10Updated 5 years ago
- ☆40Updated last year
- text correction papers☆306Updated last year
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆83Updated last year