dalinvip / cw2vecLinks
cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
☆273Updated 2 years ago
Alternatives and similar repositories for cw2vec
Users that are interested in cw2vec are comparing it to the libraries listed below
Sorting:
- 中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large☆230Updated 5 years ago
- chinese and english corpus process script, python, c++, java☆197Updated 6 years ago
- Chinese "spelling" error correction☆262Updated 7 years ago
- Word Similarity and Word Analogy Task scripts☆70Updated 7 years ago
- word2vec/glove/swivel binary file on chinese corpus☆403Updated 8 years ago
- 序列化标注工具,基于PyTorch实现BLSTM-CNN-CRF模型,CoNLL 2003 English NER测试集F1值为91.10%(word and char feature)。☆361Updated 6 years ago
- ☆328Updated 6 years ago
- A curated list of resources of chinese corpora for NLP(Natural Language Processing)☆75Updated 5 years ago
- Neural word segmentation with rich pretraining, code for ACL 2017 paper☆164Updated 6 years ago
- Named Entity Recognition for Chinese social media (Weibo). From EMNLP 2015 paper.☆550Updated 4 years ago
- 2019年百度的实体链指比赛(ccks2019),一个baseline☆113Updated 5 years ago
- 将百度ernie的paddlepaddle模型转成tensorflow模型☆177Updated 5 years ago
- Deep contextualized word representations for Chinese☆150Updated 5 years ago
- TensorFlow code and pre-trained models for BERT and ERNIE☆145Updated 5 years ago
- ☆121Updated 7 years ago
- 搜集、整理、发布 预训练 中文 词向量/字向量,与 有志之士 共同 促进 中文 自然语言处理 的 发展。☆146Updated 7 years ago
- ☆278Updated 4 years ago
- 新词发现 基于词频、凝聚系数和左右邻接信息熵☆122Updated 5 years ago
- Sequence labeling base on universal transformer (Transformer encoder) and CRF; 基于Universal Transformer + CRF 的中文分词和词性标注☆158Updated 6 years ago
- 基于siamese-lstm的中文句子相似度计算☆130Updated 6 years ago
- 基于BERT的中文序列标注☆141Updated 6 years ago
- 速度更快、效果更好的中文新词发现☆511Updated last year
- 基于字符训练词向量☆89Updated 6 years ago
- 利用预训练的中文模型实现基于bert的语义匹配模型 数据集为LCQMC官方数据☆197Updated 5 years ago
- An easy-to-use named entity recognition (NER) toolkit, implemented the Bi-LSTM+CRF model in tensorflow.☆346Updated 7 years ago
- details☆263Updated 7 years ago
- 中文文本语义相似度(Chinese Semantic Text Similarity)语料库建设☆480Updated 7 years ago
- Joint Embeddings of Chinese Words, Characters, and Fine-grained Subcharacter Components☆99Updated 5 years ago
- 简易的中文纠错和消歧☆289Updated 9 years ago
- transform multi-label classification as sentence pair task, with more training data and information☆178Updated 5 years ago