ShannonAI/ChineseBert

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ShannonAI/ChineseBert)

ShannonAI / ChineseBert

Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"

☆568

Alternatives and similar repositories for ChineseBert

Users that are interested in ChineseBert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liuwei1206 / LEBERT
View on GitHub
Code for the ACL2021 paper "Lexicon Enhanced Chinese Sequence Labelling Using BERT Adapter"
☆344Jan 15, 2022Updated 4 years ago
liushulinle / PLOME
View on GitHub
Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021
☆242Aug 16, 2022Updated 3 years ago
DaDaMrX / ReaLiSe
View on GitHub
A Multi-modal Model Chinese Spell Checker Released on ACL2021.
☆161Sep 21, 2023Updated 2 years ago
ymcui / Chinese-BERT-wwm
View on GitHub
Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）
☆10,224Apr 19, 2026Updated 3 months ago
destwang / DCN
View on GitHub
Dynamic Connected Networks for Chinese Spelling Check
☆50Apr 2, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
FDChongLi / TwoWaysToImproveCSC
View on GitHub
This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".
☆68May 31, 2021Updated 5 years ago
JunnYu / ChineseBert_pytorch
View on GitHub
huggingface ChineseBert Tokenizer
☆16Apr 16, 2022Updated 4 years ago
AidenHuen / FGN-NER
View on GitHub
The source code of 《 FGN：Fusion Glyph Network for Chinese Named Entity Recognition 》. SOTA Chinese NER method fusing both glyph represne…
☆50Mar 22, 2020Updated 6 years ago
LeeSureman / Flat-Lattice-Transformer
View on GitHub
code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer
☆1,003May 10, 2022Updated 4 years ago
CoderMusou / MECT4CNER
View on GitHub
Code for ACL 2021 paper. MECT: Multi-Metadata Embedding based Cross-Transformer for Chinese Named Entity Recognition.
☆67Nov 4, 2021Updated 4 years ago
daiyongya / markbert
View on GitHub
☆16Jul 29, 2022Updated 3 years ago
gitabtion / SoftMaskedBert-PyTorch
View on GitHub
🙈 An unofficial implementation of SoftMaskedBert based on huggingface/transformers.
☆95Apr 26, 2021Updated 5 years ago
ljynlp / W2NER
View on GitHub
Source code for AAAI 2022 paper: Unified Named Entity Recognition as Word-Word Relation Classification
☆557Jul 14, 2022Updated 4 years ago
ACL2020SpellGCN / SpellGCN
View on GitHub
SpellGCN
☆249Feb 28, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
destwang / CTCResources
View on GitHub
☆270Jul 26, 2024Updated last year
CLUEbenchmark / CLUE
View on GitHub
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
☆4,273Feb 6, 2026Updated 5 months ago
lonePatient / awesome-pretrained-chinese-nlp-models
View on GitHub
Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合
☆5,572Jun 19, 2026Updated last month
wdimmy / Automatic-Corpus-Generation
View on GitHub
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
☆295Oct 10, 2019Updated 6 years ago
destwang / CTC2021
View on GitHub
☆129Nov 3, 2022Updated 3 years ago
ZhuiyiTechnology / pretrained-models
View on GitHub
Open Language Pre-trained Model Zoo
☆1,003Nov 18, 2021Updated 4 years ago
brightmart / roberta_zh
View on GitHub
RoBERTa中文预训练模型: RoBERTa for Chinese
☆2,793Jul 22, 2024Updated 2 years ago
aopolin-lv / ECSpell
View on GitHub
[TALLIP] General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining
☆65Feb 22, 2024Updated 2 years ago
425776024 / nlpcda
View on GitHub
一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda
☆1,880Mar 18, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gitabtion / BertBasedCorrectionModels
View on GitHub
PyTorch impelementations of BERT-based Spelling Error Correction Models. 基于BERT的文本纠错模型，使用PyTorch实现。
☆277Feb 17, 2025Updated last year
CLUEbenchmark / CLUEDatasetSearch
View on GitHub
搜索所有中文NLP数据集，附常用英文NLP数据集
☆4,458Nov 21, 2022Updated 3 years ago
ShannonAI / glyce
View on GitHub
Code for NeurIPS 2019 - Glyce: Glyph-vectors for Chinese Character Representations
☆425Oct 3, 2023Updated 2 years ago
v-mipeng / LexiconAugmentedNER
View on GitHub
Reject complicated operations for incorporating lexicon for Chinese NER.
☆437Jan 22, 2022Updated 4 years ago
wangwang110 / CSC
View on GitHub
ChineseBert用于中文拼写纠错
☆43Mar 14, 2023Updated 3 years ago
loujie0822 / DeepIE
View on GitHub
DeepIE: Deep Learning for Information Extraction
☆1,937Dec 9, 2022Updated 3 years ago
princeton-nlp / SimCSE
View on GitHub
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
☆3,655Oct 16, 2024Updated last year
dbiir / UER-py
View on GitHub
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
☆3,110May 9, 2024Updated 2 years ago
ShannonAI / mrc-for-flat-nested-ner
View on GitHub
Code for ACL 2020 paper `A Unified MRC Framework for Named Entity Recognition`
☆678Jun 12, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ymcui / MacBERT
View on GitHub
Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)
☆717Apr 19, 2026Updated 3 months ago
dropreg / R-Drop
View on GitHub
☆880May 24, 2024Updated 2 years ago
airaria / TextBrewer
View on GitHub
A PyTorch-based knowledge distillation toolkit for natural language processing
☆1,705May 8, 2023Updated 3 years ago
ymcui / Chinese-ELECTRA
View on GitHub
Pre-trained Chinese ELECTRA（中文ELECTRA预训练模型）
☆1,433Apr 19, 2026Updated 3 months ago
HillZhang1999 / MuCGEC
View on GitHub
MuCGEC中文纠错数据集及文本纠错SOTA模型开源；Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…
☆570Jun 9, 2023Updated 3 years ago
lonePatient / BERT-NER-Pytorch
View on GitHub
Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)
☆2,240Mar 11, 2023Updated 3 years ago
shibing624 / pycorrector
View on GitHub
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。
☆6,495Jun 4, 2026Updated last month