JackHCC / Chinese-TokenizationLinks
利用传统方法(N-gram,HMM等)、神经网络方法(CNN,LSTM等)和预训练方法(Bert等)的中文分词任务实现【The word segmentation task is realized by using traditional methods (n-gram, HMM, etc.), neural network methods (CNN, LSTM, etc.) and pre training methods (Bert, etc.)】
☆38Updated 3 years ago
Alternatives and similar repositories for Chinese-Tokenization
Users that are interested in Chinese-Tokenization are comparing it to the libraries listed below
Sorting:
- Chinese-Text-Classification Project including bert-classification, textCNN and so on.☆161Updated 3 years ago
- 基于pytorch + bert的多标签文本分类(multi label text classification)☆109Updated 2 years ago
- NLP文本增强的两种方式:同义词替换(利用word2vec词表)和回译☆78Updated 4 years ago
- SimCSE有监督与无监督实验复现☆152Updated last year
- SimCSE中文语义相似度对比学习模型☆91Updated 3 years ago
- 中文数据集下SimCSE+ESimCSE的实现☆192Updated 3 years ago
- ☆277Updated 3 years ago
- SimCSE在中文上的复现,有监督+无监督☆280Updated 11 months ago
- 文本相似度(匹配)计算,提供Baseline、训练、推理、指标分析...代码包含TensorFlow/Pytorch双版本☆179Updated 3 years ago
- 中文文本分类,Bert,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。☆34Updated 3 years ago
- THUCNews中文文本分类数据集,该数据集包含84万篇新闻文档,总计14类;在该模型的基础上测试多个版本bert分类效果。☆68Updated 4 years ago
- 基于pytorch_bert的中文多标签分类☆93Updated 4 years ago
- 超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题☆131Updated 4 years ago
- 基于pytorch+bert的中文文本分类☆91Updated 2 years ago
- 使用pytorch搭建textCNN实现中文文本分类☆131Updated 6 years ago
- 疫情期间网民情绪识别比赛分享+top1~3解决方案☆51Updated 5 years ago
- 阿里云天池零基础入门NLP比赛_rank4选手比赛总结: https://tianchi.aliyun.com/competition/entrance/531810/introduction☆193Updated 3 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆178Updated 4 years ago
- 利用huggingface实现文本分类☆58Updated 3 years ago
- 基于Pytorch的文本分类框架,支持TextCNN、Bert、Electra等。☆64Updated 2 years ago
- 文本聚类(Kmeans、DBSCAN、LDA、Single-pass)☆354Updated 4 years ago
- 基于prompt的中文文本分类。☆55Updated 2 years ago
- ASAP: A Chinese Review Dataset Towards Aspect Category Sentiment Analysis and Rating Prediction☆344Updated 4 years ago
- experiments of some semantic matching models and comparison of experimental results.☆163Updated 3 months ago
- 中文NER的那些事儿☆320Updated 2 years ago
- 多模型中文cnews新闻文本分类☆59Updated 5 years ago
- Pytorch进行长文本分类。这里用到的网络有:FastText、TextCNN、TextRNN、TextRCNN、Transformer☆48Updated 5 years ago
- Implemention of NER model on chinese dataset.☆74Updated 2 years ago
- A PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .☆218Updated 3 years ago
- 文 本分类baseline:BERT、半监督学习UDA、对抗学习、数据增强☆104Updated 4 years ago