Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"
☆567Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for ChineseBert
Users that are interested in ChineseBert are comparing it to the libraries listed below
Sorting:
- Code for the ACL2021 paper "Lexicon Enhanced Chinese Sequence Labelling Using BERT Adapter"☆345Jan 15, 2022Updated 4 years ago
- Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021☆239Aug 16, 2022Updated 3 years ago
- A Multi-modal Model Chinese Spell Checker Released on ACL2021.☆161Sep 21, 2023Updated 2 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,175Jul 15, 2025Updated 7 months ago
- 带拼音、字形特征的文本纠错模型☆11Jan 1, 2023Updated 3 years ago
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆68May 31, 2021Updated 4 years ago
- code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer☆1,004May 10, 2022Updated 3 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合☆5,528Feb 16, 2026Updated 2 weeks ago
- Source code for AAAI 2022 paper: Unified Named Entity Recognition as Word-Word Relation Classification☆551Jul 14, 2022Updated 3 years ago
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- Dynamic Connected Networks for Chinese Spelling Check☆50Apr 2, 2024Updated last year
- huggingface ChineseBert Tokenizer☆16Apr 16, 2022Updated 3 years ago
- The source code of 《 FGN:Fusion Glyph Network for Chinese Named Entity Recognition 》. SOTA Chinese NER method fusing both glyph represne…☆50Mar 22, 2020Updated 5 years ago
- 一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda☆1,879Mar 18, 2025Updated 11 months ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,230Feb 6, 2026Updated 3 weeks ago
- DeepIE: Deep Learning for Information Extraction☆1,943Dec 9, 2022Updated 3 years ago
- SpellGCN☆251Feb 28, 2021Updated 5 years ago
- This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"☆294Oct 10, 2019Updated 6 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- ☆272Jul 26, 2024Updated last year
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,641Oct 16, 2024Updated last year
- 搜索所有中文NLP数据集,附常用英文NLP数据集☆4,419Nov 21, 2022Updated 3 years ago
- Code for ACL 2020 paper `A Unified MRC Framework for Named Entity Recognition`☆679Jun 12, 2023Updated 2 years ago
- 🙈 An unofficial implementation of SoftMaskedBert based on huggingface/transformers.☆97Apr 26, 2021Updated 4 years ago
- PyTorch impelementations of BERT-based Spelling Error Correction Models. 基于BERT的文本纠错模型,使用PyTorch实现。☆279Feb 17, 2025Updated last year
- [TALLIP] General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining☆63Feb 22, 2024Updated 2 years ago
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,696May 8, 2023Updated 2 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,440Jul 15, 2025Updated 7 months ago
- Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)☆2,234Mar 11, 2023Updated 2 years ago
- ChineseBert用于中文拼写纠错☆43Mar 14, 2023Updated 2 years ago
- ☆880May 24, 2024Updated last year
- Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)☆700Jul 15, 2025Updated 7 months ago
- ☆129Nov 3, 2022Updated 3 years ago
- Code for ACL 2021 paper. MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity Recognition.☆68Nov 4, 2021Updated 4 years ago
- ERNIE Pytorch Version☆929Jul 26, 2023Updated 2 years ago
- pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。☆6,374Jan 12, 2026Updated last month
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆565Jun 9, 2023Updated 2 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,156Jan 22, 2024Updated 2 years ago