JackHCC / Chinese-Tokenization
利用传统方法(N-gram,HMM等)、神经网络方法(CNN,LSTM等)和预训练方法(Bert等)的中文分词任务实现【The word segmentation task is realized by using traditional methods (n-gram, HMM, etc.), neural network methods (CNN, LSTM, etc.) and pre training methods (Bert, etc.)】
☆32Updated 2 years ago
Alternatives and similar repositories for Chinese-Tokenization:
Users that are interested in Chinese-Tokenization are comparing it to the libraries listed below
- 疫情期间网民情绪识别比赛分享+top1~3解决方案☆51Updated 4 years ago
- 文本分类baseline:BERT、半监督学习UDA、对抗学习、数据增强☆100Updated 3 years ago
- NLP文本增强的两种方式:同义词替换(利用word2vec词表)和回译☆73Updated 3 years ago
- SimCSE中文语义相似度对比学习模型☆80Updated 2 years ago
- 基于pytorch_bert的中文多标签分类☆87Updated 3 years ago
- 中文数据集下SimCSE+ESimCSE的实现☆191Updated 2 years ago
- ☆277Updated 2 years ago
- bert pytorch模型微调用于的多标签文本分类☆128Updated 5 years ago
- 基于pytorch + bert的多标签文本分类(multi label text classification)☆101Updated last year
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆175Updated 3 years ago
- SimCSE有监督与无监督实验复现☆147Updated 11 months ago
- 基于Pytorch的文本分类框架,支持TextCNN、Bert、Electra等。☆61Updated last year
- Chinese-Text-Classification Project including bert-classification, textCNN and so on.☆151Updated 2 years ago
- 中文无监督SimCSE Pytorch实现☆133Updated 3 years ago
- CMeEE/CBLUE/NER实体识别☆125Updated 2 years ago
- ☆88Updated 3 years ago
- experiments of some semantic matching models and comparison of experimental results.☆161Updated last year
- 利用bert预训练模型生成句向量或词向量☆28Updated 4 years ago
- Pytorch进行长文本分类。这里用到的网络有:FastText、TextCNN、TextRNN、TextRCNN、Transformer☆46Updated 4 years ago
- 继续预训练中文bert☆30Updated 3 years ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆97Updated 2 years ago
- THUCNews中文文本分类数据集,该数据集包含84万篇新闻文档,总计14类;在该模型的基础上测试多个版本bert分类效果。☆58Updated 4 years ago
- 基于prompt的中文文本分类。☆54Updated last year
- 利用huggingface实现文本分类☆57Updated 2 years ago
- Summary and comparison of Chinese classification models☆34Updated 2 years ago
- SimCSE在中文上的复现,有监督+无监督☆272Updated 3 years ago
- 基于词汇信息融合的中文NER模型☆164Updated 2 years ago
- 无监督中文关键词抽取(Keyphrase Extraction),基于统计,基于图【LDA与PageRank(TextRank, TPR, Salience Rank, Single TPR等)】,基于嵌入【SIFRank等】,开箱即用!☆105Updated 2 years ago
- 利用bert和textcnn解决多标签文本分类的demo。☆31Updated 2 years ago
- Implementation of Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification.☆76Updated 3 years ago