quincyliang / nlp-data-augmentationView external linksLinks
Data Augmentation for NLP. NLP数据增强
☆294Dec 10, 2020Updated 5 years ago
Alternatives and similar repositories for nlp-data-augmentation
Users that are interested in nlp-data-augmentation are comparing it to the libraries listed below
Sorting:
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。☆1,386May 31, 2022Updated 3 years ago
- 自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,b…☆1,539Sep 23, 2021Updated 4 years ago
- NoiseMix - data generation for natural language☆40May 26, 2018Updated 7 years ago
- 复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!☆2,797Aug 30, 2025Updated 5 months ago
- Data augmentation for NLP, presented at EMNLP 2019☆1,650Mar 19, 2023Updated 2 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,879Mar 18, 2025Updated 10 months ago
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Jan 3, 2020Updated 6 years ago
- A BERT-based Chinese Text Encoder Enhanced by N-gram Representations☆647Jul 24, 2022Updated 3 years ago
- Datasets, SOTA results of every fields of Chinese NLP☆1,815Apr 7, 2022Updated 3 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,986Nov 21, 2022Updated 3 years ago
- Bert finetune for CMRC2018, CJRC, DRCD, CHID, C3☆184Jun 4, 2020Updated 5 years ago
- Text-Similarity Method in Pytorch☆469Dec 9, 2018Updated 7 years ago
- 文本匹配的相关模型DSSM,ESIM,ABCNN,BIMPM等,数据集为LCQMC官方数据☆470May 8, 2022Updated 3 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆229Sep 13, 2019Updated 6 years ago
- Open Language Pre-trained Model Zoo☆1,004Nov 18, 2021Updated 4 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,230Feb 6, 2026Updated last week
- Collections of Chinese NLP corpus☆917Dec 28, 2020Updated 5 years ago
- 面向金融领域的事件主体抽取(ccks2019),一个baseline☆119May 13, 2019Updated 6 years ago
- bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目☆1,848Mar 21, 2021Updated 4 years ago
- Modify Chinese text, modified on LaserTagger Model. 文本复述,基于lasertagger做中文文本数据增强。☆322Jan 3, 2024Updated 2 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,156Jan 22, 2024Updated 2 years ago
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,649Jul 15, 2025Updated 6 months ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,772Jul 22, 2024Updated last year
- Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard☆1,786Feb 18, 2023Updated 2 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,173Jul 15, 2025Updated 6 months ago
- DeepIE: Deep Learning for Information Extraction☆1,944Dec 9, 2022Updated 3 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,257Mar 7, 2024Updated last year
- Open Chinese Language Pre-trained Model Zoo☆984Mar 18, 2020Updated 5 years ago
- 机器检索阅读联合学习,莱斯杯:全国第二届“军事智能机器阅读”挑战赛 rank6 方案☆128Oct 20, 2020Updated 5 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,439Jul 15, 2025Updated 6 months ago
- 基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口☆1,295Jun 13, 2021Updated 4 years ago
- The code for CCF-BDCI-Sentiment-Analysis-Baseline☆430Dec 8, 2022Updated 3 years ago
- question answering, reading comprehension toolkit☆165Oct 16, 2022Updated 3 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,858Feb 6, 2026Updated last week
- Data augmentation for NLP☆4,645Jun 24, 2024Updated last year
- 2019语言与智能技术竞赛-基于知识图谱的主动聊天☆115May 24, 2019Updated 6 years ago
- The 4th rank system of the SemEval 2021 Task4.☆10May 7, 2022Updated 3 years ago
- ai challenger 2018细粒度情感分类第一名解决方案, A training framework itegrating tensorflow and pytorch☆577Nov 27, 2022Updated 3 years ago