JackHCC / Chinese-Tokenization
利用传统方法(N-gram,HMM等)、神经网络方法(CNN,LSTM等)和预训练方法(Bert等)的中文分词任务实现【The word segmentation task is realized by using traditional methods (n-gram, HMM, etc.), neural network methods (CNN, LSTM, etc.) and pre training methods (Bert, etc.)】
☆32Updated 2 years ago
Alternatives and similar repositories for Chinese-Tokenization:
Users that are interested in Chinese-Tokenization are comparing it to the libraries listed below
- 疫情期间网民情绪识别比赛分享+top1~3解决方案☆50Updated 4 years ago
- NLP文本增强的两种方式:同义词替换(利用word2vec词表)和回译☆74Updated 3 years ago
- 中文文本分类,Bert,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。☆32Updated 2 years ago
- NLP 相关的项目 如:词向量,文本分类,文本匹配,NER,信息抽取,文本生成以及NLP在电商中的应用☆34Updated last year
- 基于prompt的中文文本分类。☆54Updated last year
- ☆278Updated 2 years ago
- 中文无监督SimCSE Pytorch实现☆133Updated 3 years ago
- experiments of some semantic matching models and comparison of experimental results.☆160Updated last year
- 文本分类baseline:BERT、半监督学习UDA、对抗学习、数据增强☆101Updated 3 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆176Updated 3 years ago
- 利用bert预训练模型生成句向量或词向量☆27Updated 4 years ago
- bert pytorch模型微调用于的多标签文本分类☆131Updated 5 years ago
- 中文数据集下SimCSE+ESimCSE的实现☆191Updated 2 years ago
- 超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题☆129Updated 3 years ago
- SimCSE有监督与无监督实验复现☆149Updated last year
- ☆88Updated 3 years ago
- bert文本 多分类(情感分析)、bert-bilstm-crf序列标注任务(快递地址的序列标注任务)☆31Updated 3 years ago
- 利用huggingface实现文本分类☆58Updated 3 years ago
- 基于pytorch + bert的多标签文本分类(multi label text classification)☆102Updated last year
- Summary and comparison of Chinese classification models☆34Updated 2 years ago
- SimCSE中文语义相似度对比学习模型☆84Updated 3 years ago
- Pytorch进行长文本分类。这里用到的网络有:FastText、TextCNN、TextRNN、TextRCNN、Transformer☆47Updated 4 years ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆98Updated 2 years ago
- Reproducing the paper — Deep Short Text Classification with Knowledge Powered Attention☆102Updated 4 years ago
- Use deep models including BiLSTM, ABCNN, ESIM, RE2, BERT, etc. and evaluate on 5 Chinese NLP datasets: LCQMC, BQ Corpus, ChineseSTS, OCN…☆76Updated 2 years ago
- 第十届大学生服务外包大赛--A01商品短文本分类。基于CNN、Bi-LSTM、Attention、Adversarial等方法实现商品短文本分类任务,并基于Flask开发Web版本的交互演示界面。☆28Updated 2 years ago
- 基于CNN、RNN、GCN、BERT的中文文本分类☆43Updated last year
- 法研杯2021类案检索赛道三等奖方案☆52Updated 3 years ago
- 多模型中文cnews新闻文本分类☆54Updated 5 years ago
- 继续预训练中文bert☆30Updated 3 years ago