dalinvip / cw2vec
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
☆274Updated last year
Related projects ⓘ
Alternatives and complementary repositories for cw2vec
- 序列化标注工具,基于PyTorch实现BLSTM-CNN-CRF模型,CoNLL 2003 English NER测试集F1值为91.10%(word and char feature)。☆362Updated 6 years ago
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆230Updated 5 years ago
- chinese and english corpus process script, python, c++, java☆194Updated 5 years ago
- Neural word segmentation with rich pretraining, code for ACL 2017 paper☆166Updated 5 years ago
- Chinese "spelling" error correction☆257Updated 6 years ago
- Word Similarity and Word Analogy Task scripts☆72Updated 6 years ago
- 将百度ernie的paddlepaddle模型转成tensorflow模型☆177Updated 5 years ago
- 搜集、整理、发布 预训练 中文 词向量/字向量,与 有志之士 共同 促进 中文 自然语言处理 的 发展。☆146Updated 6 years ago
- 利用预训练的中文模型实现基于bert的语义匹配模型 数据集为LCQMC官方数据☆193Updated 4 years ago
- Simple Solution for Multi-Criteria Chinese Word Segmentation☆300Updated 4 years ago
- Sequence labeling base on universal transformer (Transformer encoder) and CRF; 基于Universal Transformer + CRF 的中文分词和词性标注☆154Updated 5 years ago
- Deep contextualized word representations for Chinese☆152Updated 5 years ago
- A curated list of resources of chinese corpora for NLP(Natural Language Processing)☆73Updated 5 years ago
- word2vec/glove/swivel binary file on chinese corpus☆398Updated 8 years ago
- Named Entity Recognition for Chinese social media (Weibo). From EMNLP 2015 paper.☆546Updated 4 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆145Updated 5 years ago
- 速度更快、效果更好的中文新词发现☆510Updated 8 months ago
- Convolutional neural network and word embeddings for Chinese word segmentation☆142Updated 2 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆167Updated 5 years ago
- 基于字符训练词向量☆89Updated 6 years ago
- ☆278Updated 3 years ago
- Text-Similarity Method in Pytorch☆469Updated 5 years ago
- Source codes and corpora of paper "Iterated Dilated Convolutions for Chinese Word Segmentation"☆136Updated 3 years ago
- An collection of Chinese nlp corpus including basic Chinese syntatic wordset, semantic wordset, historic corpus and evaluate corpus. 中文自然…☆437Updated 5 years ago
- 文本匹配的相关模型DSSM,ESIM,ABCNN,BIMPM等,数据集为LCQMC官方数据☆467Updated 2 years ago
- QANet+DuReader中文机器阅读理解☆223Updated 6 years ago
- ☆329Updated 5 years ago
- 基于siamese-lstm的中文句子相似度计算☆130Updated 6 years ago
- 2019年百度的实体链指比赛(ccks2019),一个baseline☆114Updated 5 years ago