cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
☆274Mar 20, 2023Updated 2 years ago
Alternatives and similar repositories for cw2vec
Users that are interested in cw2vec are comparing it to the libraries listed below
Sorting:
- 基于字符训练词向量☆90Jun 6, 2018Updated 7 years ago
- chinese and english corpus process script, python, c++, java☆198Jan 22, 2019Updated 7 years ago
- This is a pytorch implement of cw2vec☆30Jan 18, 2019Updated 7 years ago
- Joint Embeddings of Chinese Words, Characters, and Fine-grained Subcharacter Components☆100Jun 21, 2019Updated 6 years ago
- Implementation of the cw2vec model☆29Jul 20, 2018Updated 7 years ago
- Word Similarity and Word Analogy Task scripts☆71May 12, 2018Updated 7 years ago
- cw2vec implementation in pytorch☆17Mar 6, 2019Updated 6 years ago
- Chinese NER using Lattice LSTM. Code for ACL 2018 paper.☆1,834Apr 25, 2019Updated 6 years ago
- ☆79Aug 19, 2016Updated 9 years ago
- Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量☆454Dec 15, 2018Updated 7 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,183Oct 30, 2023Updated 2 years ago
- 完全端到端的核心实体识别与情感预测☆35Jun 5, 2019Updated 6 years ago
- ☆301Aug 24, 2020Updated 5 years ago
- 新词发现☆66May 30, 2014Updated 11 years ago
- COS960: A Chinese Word Similarity Dataset of 960 Word Pairs☆36Jun 6, 2019Updated 6 years ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,855Aug 2, 2024Updated last year
- Neural word segmentation with rich pretraining, code for ACL 2017 paper☆164Jan 10, 2019Updated 7 years ago
- A BERT-based Chinese Text Encoder Enhanced by N-gram Representations☆647Jul 24, 2022Updated 3 years ago
- ☆31Jun 2, 2018Updated 7 years ago
- Pre-trained ELMo Representations for Many Languages☆1,460May 19, 2021Updated 4 years ago
- Keras implementation of Bilateral Multi-Perspective Matching.☆61Jun 22, 2017Updated 8 years ago
- Details of paper cw2vec☆82May 13, 2018Updated 7 years ago
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,897Jun 30, 2022Updated 3 years ago
- Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services☆4,900Feb 24, 2021Updated 5 years ago
- 自然语言基础模型☆563Apr 29, 2019Updated 6 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,862Feb 6, 2026Updated 3 weeks ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,175Jul 15, 2025Updated 7 months ago
- all kinds of text classification models and more with deep learning☆7,951Sep 28, 2023Updated 2 years ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆25Jul 8, 2022Updated 3 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,983Nov 21, 2022Updated 3 years ago
- Four word embedding models implemented in Python. Supporting arbitrary context features☆849Aug 22, 2019Updated 6 years ago
- Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取☆2,266Feb 1, 2024Updated 2 years ago
- ☆330May 10, 2019Updated 6 years ago
- 1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案)☆1,058Jul 16, 2018Updated 7 years ago
- pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。☆6,374Jan 12, 2026Updated last month
- ccks baidu entity link 实体链接 第一名☆843Dec 19, 2023Updated 2 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,613Mar 4, 2023Updated 2 years ago
- Language Technology Platform☆5,236Jun 2, 2025Updated 8 months ago