Chinese NLP Data Augmentation, BERT Contextual Augmentation
☆112Mar 29, 2022Updated 3 years ago
Alternatives and similar repositories for NLPDataAugmentation
Users that are interested in NLPDataAugmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,878Mar 18, 2025Updated last year
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。☆1,385May 31, 2022Updated 3 years ago
- NLP文本增强的两种方式:同义词替换(利用word2vec词表)和回译☆78Apr 6, 2021Updated 4 years ago
- ☆92Jun 3, 2020Updated 5 years ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,581Nov 21, 2023Updated 2 years ago
- 抽取式NLP模型(阅读理解模型,MRC)实现词义消歧(WSD)☆14May 10, 2022Updated 3 years ago
- 基于回译增强数据,目前整合了百度、有道、谷歌(需翻墙)翻译。☆21Nov 5, 2020Updated 5 years ago
- sodic2021 法律咨询智能问答 Baseline 线上35+☆17Jun 1, 2021Updated 4 years ago
- python3 pytorch>=0.4☆11Dec 25, 2019Updated 6 years ago
- a bert for retrieval and generation☆859Feb 26, 2021Updated 5 years ago
- 利用哈工大同义词林替换问答文本内的同义词进行语料扩充☆37Jun 6, 2019Updated 6 years ago
- Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021☆306Oct 23, 2023Updated 2 years ago
- Contextual augmentation, a text data augmentation using a bidirectional language model.☆192Jan 3, 2020Updated 6 years ago
- Data augmentation for NLP☆4,652Jun 24, 2024Updated last year
- 历届中文句法错误诊断技术评测数据集☆44Jun 4, 2022Updated 3 years ago
- ☆13Oct 18, 2022Updated 3 years ago
- 1、基于gensim的word2vec模型的训练 2、基于skip_gram模型的词向量模型的训练 3、词向量模型的评估(近义词、反义词) 4、兼容基于gensim的word2vec模型的训练、基于skip_gram模型的词向量模型的训练的增量模型的训练 5、转换词向量☆10Jul 1, 2020Updated 5 years ago
- 全球人工智能技术创新大赛-赛道三:小布助手对话短文本语义匹配☆12Apr 5, 2021Updated 4 years ago
- Data augmentation for NLP, presented at EMNLP 2019☆1,651Mar 19, 2023Updated 3 years ago
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- ☆54Dec 22, 2020Updated 5 years ago
- 面向金融领域的小样本跨类迁移事件抽取 第三名 方案及代码☆17Dec 23, 2020Updated 5 years ago
- 转换 https://github.com/brightmart/albert_zh 到google格式☆61Sep 28, 2020Updated 5 years ago
- keras implement of transformers for humans☆5,424Nov 11, 2024Updated last year
- nlp标注平台 前端/后端☆16Oct 29, 2021Updated 4 years ago
- 基于pycorrector以及chatglm3-6b的文本纠错☆12Mar 10, 2024Updated 2 years ago
- t5-model-onnx,中文拼写纠错,Chinese spelling correction。☆15Sep 18, 2022Updated 3 years ago
- 中文机器阅读理解数据集☆65Jan 15, 2020Updated 6 years ago
- Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension☆171Apr 20, 2022Updated 3 years ago
- Code for GSN: A Graph-Structured Network for Multi-Party Dialogues☆30Aug 10, 2019Updated 6 years ago
- ☆65May 11, 2022Updated 3 years ago
- Official repository for "MMConv: An Environment for Multimodal Conversational Search across Multiple Domains"☆34Jul 15, 2021Updated 4 years ago
- FewCLUE 小样本学习测评基准,中文版☆519Sep 21, 2022Updated 3 years ago
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Mar 9, 2022Updated 4 years ago
- ☆80Dec 17, 2020Updated 5 years ago
- Code for the ACL 2022 (Long paper): "New Intent Discovery with Pre-training and Contrastive Learning".☆14Jul 18, 2022Updated 3 years ago
- 李傲龍的博客☆82Jul 17, 2024Updated last year
- ☆440Apr 25, 2025Updated 11 months ago
- Ladder Side-Tuning在CLUE上的简单尝试☆22Jun 20, 2022Updated 3 years ago