tongchangD/text_data_enhancement_with_LaserTagger

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tongchangD/text_data_enhancement_with_LaserTagger)

tongchangD / text_data_enhancement_with_LaserTagger

Modify Chinese text, modified on LaserTagger Model. 文本复述，基于lasertagger做中文文本数据增强。

☆320

Alternatives and similar repositories for text_data_enhancement_with_LaserTagger

Users that are interested in text_data_enhancement_with_LaserTagger are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mleader2 / text_scalpel
View on GitHub
Modify Chinese text, modified on LaserTagger Model. I name it "文本手术刀".目前，本项目实现了一个文本复述任务，用于NLP语料的数据增强。
☆215Mar 24, 2023Updated 3 years ago
425776024 / lasertagger-chinese
View on GitHub
lasertagger-chinese；lasertagger中文学习案例，案例数据，注释，shell运行
☆75Mar 25, 2023Updated 3 years ago
google-research / lasertagger
View on GitHub
☆603Mar 12, 2026Updated 4 months ago
425776024 / nlpcda
View on GitHub
一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda
☆1,879Mar 18, 2025Updated last year
Wys997 / Chinese-Paraphrase-from-Quora
View on GitHub
Research on the Construction and Application of Paraphrase Parallel Corpus
☆11Oct 26, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zhanlaoban / EDA_NLP_for_Chinese
View on GitHub
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
☆1,383May 31, 2022Updated 4 years ago
tongchangD / bert_for_corrector
View on GitHub
基于bert进行中文文本纠错
☆242Jun 12, 2023Updated 3 years ago
tongchangD / PMI
View on GitHub
PMI, 是互信息(NMI)中的一种特例, 而互信息,是源于信息论中的一个概念,主要用于衡量2个信号的关联程度.至于PMI,是在文本处理中,用于计算两个词语之间的关联程度.比起传统的相似度计算, pmi的好处在于,从统计的角度发现词语共现的情况来分析出词语间是否存在语义相关…
☆15Aug 24, 2020Updated 5 years ago
sunyilgdx / SIFRank_zh
View on GitHub
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法（论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained La…
☆431May 17, 2020Updated 6 years ago
CLUEbenchmark / CLUEPretrainedModels
View on GitHub
高质量中文预训练模型集合：最先进大模型、最快小模型、相似度专门模型
☆810Jul 8, 2020Updated 6 years ago
zzy99 / epidemic-sentence-pair
View on GitHub
天池疫情相似句对判定大赛线上第一名方案
☆434Oct 17, 2020Updated 5 years ago
ZhuiyiTechnology / pretrained-models
View on GitHub
Open Language Pre-trained Model Zoo
☆1,003Nov 18, 2021Updated 4 years ago
dbiir / UER-py
View on GitHub
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
☆3,111May 9, 2024Updated 2 years ago
airaria / TextBrewer
View on GitHub
A PyTorch-based knowledge distillation toolkit for natural language processing
☆1,705May 8, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ymcui / Chinese-ELECTRA
View on GitHub
Pre-trained Chinese ELECTRA（中文ELECTRA预训练模型）
☆1,433Apr 19, 2026Updated 3 months ago
ChineseGLUE / ChineseGLUE
View on GitHub
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
☆1,782Feb 18, 2023Updated 3 years ago
ymcui / Chinese-BERT-wwm
View on GitHub
Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）
☆10,224Apr 19, 2026Updated 3 months ago
ZhuiyiTechnology / simbert
View on GitHub
a bert for retrieval and generation
☆860Feb 26, 2021Updated 5 years ago
brightmart / albert_zh
View on GitHub
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
☆3,981Nov 21, 2022Updated 3 years ago
ymcui / Chinese-XLNet
View on GitHub
Pre-Trained Chinese XLNet（中文XLNet预训练模型）
☆1,647Apr 19, 2026Updated 3 months ago
icip-cas / Chinese-PPDB
View on GitHub
Chineses-PPDB
☆14Nov 23, 2020Updated 5 years ago
brightmart / roberta_zh
View on GitHub
RoBERTa中文预训练模型: RoBERTa for Chinese
☆2,793Jul 22, 2024Updated 2 years ago
huawei-noah / Pretrained-Language-Model
View on GitHub
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,162Jan 22, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
tongchangD / Language_recognize
View on GitHub
基于语音的语种识别
☆30Jul 23, 2023Updated 3 years ago
zhpmatrix / nlp-competitions-list-review
View on GitHub
复盘所有NLP比赛的TOP方案，只关注NLP比赛，持续更新中！
☆2,805Apr 4, 2026Updated 3 months ago
liuhuanyong / ChineseSemanticKB
View on GitHub
ChineseSemanticKB,chinese semantic knowledge base, 面向中文处理的12类、百万规模的语义常用词典，包括34万抽象语义库、34万反义语义库、43万同义语义库等，可支持句子扩展、转写、事件抽象与泛化等多种应用场景。
☆783Mar 17, 2023Updated 3 years ago
BitVoyage / FastBERT
View on GitHub
对ACL2020 FastBERT论文的复现，论文地址//arxiv.org/pdf/2004.02178.pdf
☆191Dec 15, 2021Updated 4 years ago
hiyoung123 / Chinese-Text-Classification-Pytorch
View on GitHub
基于Pytorch实现的中文文本分类脚手架，以及常用模型对比。
☆18Apr 23, 2021Updated 5 years ago
quincyliang / nlp-data-augmentation
View on GitHub
Data Augmentation for NLP. NLP数据增强
☆294Dec 10, 2020Updated 5 years ago
FudanNLP / fastHan
View on GitHub
fastHan是基于fastNLP与pytorch实现的中文自然语言处理工具，像spacy一样调用方便。
☆761Dec 9, 2023Updated 2 years ago
jasonwei20 / eda_nlp
View on GitHub
Data augmentation for NLP, presented at EMNLP 2019
☆1,651Mar 19, 2023Updated 3 years ago
zhaogaofeng611 / TextMatch
View on GitHub
基于Pytorch的，中文语义相似度匹配模型（ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet）
☆797Mar 22, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
loujie0822 / DeepIE
View on GitHub
DeepIE: Deep Learning for Information Extraction
☆1,937Dec 9, 2022Updated 3 years ago
brightmart / nlp_chinese_corpus
View on GitHub
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
☆9,907Feb 6, 2026Updated 5 months ago
ZhuiyiTechnology / WoBERT
View on GitHub
以词为基本单位的中文BERT
☆475Nov 18, 2021Updated 4 years ago
YunwenTechnology / QueryGeneration
View on GitHub
☆90Jun 20, 2020Updated 6 years ago
shibing624 / pycorrector
View on GitHub
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。
☆6,495Updated this week
bojone / bert4keras
View on GitHub
keras implement of transformers for humans
☆5,417Nov 11, 2024Updated last year
yongzhuo / nlp_xiaojiang
View on GitHub
自然语言处理（nlp），小姜机器人（闲聊检索式chatbot），BERT句向量-相似度（Sentence Similarity），XLNET句向量-相似度（text xlnet embedding），文本分类（Text classification），实体提取（ner，b…
☆1,535Sep 23, 2021Updated 4 years ago