xuzf-git / WordSegment-and-PosTagLinks

基于Hmm模型和Viterbi算法实现中文分词及词性标注，使用最大概率算法进行优化。人民日报语料：分词(F1:96.189%)；词性标注(F1:97.934%)

☆26

Alternatives and similar repositories for WordSegment-and-PosTag

Users that are interested in WordSegment-and-PosTag are comparing it to the libraries listed below

Sorting:

lijqhs / text-classification-cn
中文文本分类实践，基于搜狗新闻语料库，采用传统机器学习方法以及预训练模型等方法
☆190Updated 4 years ago
hemingkx / WordSeg
A PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .
☆211Updated 3 years ago
lyj157175 / nlp_projects
NLP实战项目
☆102Updated 2 years ago
xinyi-code / Chinese-Text-Classification
Chinese-Text-Classification Project including bert-classification, textCNN and so on.
☆160Updated 3 years ago
Ricardokevins / Kevinpro-NLP-demo
All NLP you Need Here. 目前包含15个NLP demo的pytorch实现（大量代码借鉴于其他开源项目，原先是自己玩的，后来干脆也开源出来）
☆287Updated this week
qingyujean / document-level-classification
超长文本分类（大于1000字）；文档级/篇章级文本分类；主要是解决长距离依赖问题
☆132Updated 3 years ago
BeHappyForMe / Multi_Model_Classification
多模型中文cnews新闻文本分类
☆59Updated 5 years ago
taishan1994 / pytorch_bert_chinese_text_classification
基于pytorch+bert的中文文本分类
☆86Updated 2 years ago
jasoncao11 / nlp-notebook
NLP 领域常见任务的实现，包括新词发现、以及基于pytorch的词向量、中文文本分类、实体识别、摘要文本生成、句子相似度判断、三元组抽取、预训练模型等。
☆533Updated 2 years ago
BeiCunNan / Sentiment_Analysis_Imdb
Using Bert/Roberta + LSTM/GRU/BiLSTM/TextCNN to do the sentiment analysis on the imdb datasets.
☆142Updated 2 years ago
wzzzd / text_classifier_pytorch
基于Pytorch的文本分类框架，支持TextCNN、Bert、Electra等。
☆62Updated 2 years ago
BrownSweater / BERT_SMP2020-EWECT
在SMP2020的微博情绪分类任务上，微调在中文预料上预训练的BERT模型，进行文本分类。
☆111Updated 3 years ago
murray-z / multi_label_classification
基于pytorch + bert的多标签文本分类（multi label text classification）
☆107Updated 2 years ago
illiterate / BertClassifier
基于PyTorch的BERT中文文本分类模型（BERT Chinese text classification model implemented by PyTorch）
☆190Updated last year
jjljkjljk / SimCSE-Chinese
SimCSE中文语义相似度对比学习模型
☆87Updated 3 years ago
shibing624 / nlp-tutorial
自然语言处理（NLP）教程，包括：词向量，词法分析，预训练语言模型，文本分类，文本语义匹配，信息抽取，翻译，对话。
☆460Updated 3 years ago
JackHCC / Chinese-Tokenization
利用传统方法（N-gram，HMM等）、神经网络方法（CNN，LSTM等）和预训练方法（Bert等）的中文分词任务实现【The word segmentation task is realized by using traditional methods (n-gram, …
☆35Updated 3 years ago
kangyishuai / NEWS-TEXT-CLASSIFICATION
零基础入门NLP - 新闻文本分类正式赛第一名方案
☆234Updated 4 years ago
wjn1996 / scrapy_for_zh_wiki
基于scrapy的层次优先队列方法爬取中文维基百科，并自动抽取结构和半结构数据
☆155Updated 2 years ago
guolipa / nlp-algorithm
☆39Updated 2 years ago
rsanshierli / EasyBert
基于Pytorch的Bert应用，包括命名实体识别、情感分析、文本分类以及文本相似度等
☆799Updated 4 years ago
SunnyGJing / t5-pegasus-chinese
基于GOOGLE T5中文生成式模型的摘要生成/指代消解，支持batch批量生成，多进程
☆226Updated last year
WhiteGive-Boy / CWS-Hmm_BiLSTM-CRF
CWS中文分词 HMM BiLSTM+CRF pytorch 细致实现
☆48Updated 3 years ago
mzc421 / Pytorch-NLP
使用Pytorch框架对NLP方向上的文本分类、实体识别、三元组抽取做代码实战
☆192Updated last year
xinyi-code / NER-Pytorch-Chinese
Implemention of NER model on chinese dataset.
☆74Updated 2 years ago
zhoujx4 / NLP-Data-Augmentation
NLP文本增强的两种方式：同义词替换（利用word2vec词表）和回译
☆77Updated 4 years ago
DengBoCong / text-similarity
文本相似度（匹配）计算，提供Baseline、训练、推理、指标分析...代码包含TensorFlow/Pytorch双版本
☆178Updated 3 years ago
vdogmcgee / SimCSE-Chinese-Pytorch
SimCSE在中文上的复现，有监督+无监督
☆278Updated 5 months ago
Lisennlp / chinese_word_disambiguation
中文词义消歧项目（Chinese WSD），基于LSTM + ATTENTION模型架构，Pytorch实现。代码简单，上手容易。
☆17Updated 3 years ago
qingyujean / Magic-NLPer
关于机器学习，深度学习，自然语言处理等各种算法的实现、示例，与博客文章配套，论文复现等
☆208Updated 2 years ago