tongchangD / text_data_enhancement_with_LaserTaggerView external linksLinks
Modify Chinese text, modified on LaserTagger Model. 文本复 述,基于lasertagger做中文文本数据增强。
☆322Jan 3, 2024Updated 2 years ago
Alternatives and similar repositories for text_data_enhancement_with_LaserTagger
Users that are interested in text_data_enhancement_with_LaserTagger are comparing it to the libraries listed below
Sorting:
- Modify Chinese text, modified on LaserTagger Model. I name it "文本手术刀".目前,本项目实现了一个文本复述任务,用于NLP语料的数据增强。☆214Mar 24, 2023Updated 2 years ago
- lasertagger-chinese;lasertagger中文学习案例,案例数据,注释,shell运行☆76Mar 25, 2023Updated 2 years ago
- ☆604Feb 5, 2026Updated last week
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,879Mar 18, 2025Updated 10 months ago
- 基于bert进行中文文本纠错☆240Jun 12, 2023Updated 2 years ago
- Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained La…☆433May 17, 2020Updated 5 years ago
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。☆1,386May 31, 2022Updated 3 years ago
- Research on the Construction and Application of Paraphrase Parallel Corpus☆11Oct 26, 2020Updated 5 years ago
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- Open Language Pre-trained Model Zoo☆1,004Nov 18, 2021Updated 4 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,698May 8, 2023Updated 2 years ago
- 天池 疫情相似句对判定大赛 线上第一名方案☆434Oct 17, 2020Updated 5 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- a bert for retrieval and generation☆860Feb 26, 2021Updated 4 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,439Jul 15, 2025Updated 7 months ago
- Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard☆1,786Feb 18, 2023Updated 2 years ago
- ACL 2019论文复现:Improving Multi-turn Dialogue Modelling with Utterance ReWriter☆138Jan 23, 2020Updated 6 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,986Nov 21, 2022Updated 3 years ago
- 基于Pytorch实现的中文文本分类脚手架,以及常用模型对比。☆18Apr 23, 2021Updated 4 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,173Jul 15, 2025Updated 7 months ago
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆138May 25, 2020Updated 5 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,155Jan 22, 2024Updated 2 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,649Jul 15, 2025Updated 7 months ago
- 以词为基本单位的中文BERT☆474Nov 18, 2021Updated 4 years ago
- 复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!☆2,797Aug 30, 2025Updated 5 months ago
- fastHan是基于fastNLP与pytorch实现的中文自然语言处理工具,像spacy一样调用方便。☆760Dec 9, 2023Updated 2 years ago
- An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具,仅占35M内存 www.jionlp.com☆556Nov 21, 2023Updated 2 years ago
- ChineseSemanticKB,chinese semantic knowledge base, 面向中文处理的12类、百万规模的语义常用词典,包括34万抽象语义库、34万反义语义库、43万同义语义库等,可支持句子扩展、转写、事件抽象与泛化等多种应用场景。☆780Mar 17, 2023Updated 2 years ago
- 基于Pytorch的,中文语义相似度匹配模型(ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet)☆796Mar 22, 2020Updated 5 years ago
- Data Augmentation for NLP. NLP数据增强☆294Dec 10, 2020Updated 5 years ago
- 对ACL2020 FastBERT论文的复现,论文地址//arxiv.org/pdf/2004.02178.pdf☆193Dec 15, 2021Updated 4 years ago
- 中文生成式预训练模型☆99Aug 28, 2020Updated 5 years ago
- 速度更快、效果更好的中文新词发现☆513Mar 15, 2024Updated last year
- 中文文本摘要(text summarization)工具包, 抽取式中文文本摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(gra…☆419Jun 17, 2024Updated last year
- CCKS 2019 中文短文本实体链指比赛技术创新奖解决方案☆412Mar 24, 2023Updated 2 years ago
- pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。☆6,368Jan 12, 2026Updated last month
- 中文知识库问答代码,CCKS2019 CKBQA评测第四名解决方案☆478Apr 5, 2021Updated 4 years ago
- keras implement of transformers for humans☆5,420Nov 11, 2024Updated last year