houbb/nlp-hanzi-similar

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/houbb/nlp-hanzi-similar)

houbb / nlp-hanzi-similar

The hanzi similar tool.(汉字相似度计算工具，中文形近字算法。可用于手写汉字识别纠正，文本混淆等。)

☆298

Alternatives and similar repositories for nlp-hanzi-similar

Users that are interested in nlp-hanzi-similar are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

houbb / segment
View on GitHub
The jieba-analysis tool for java.（基于结巴分词词库实现的更加灵活优雅易用，高性能的 java 分词实现。支持词性标注。）
☆157Feb 28, 2024Updated 2 years ago
kfcd / chaizi
View on GitHub
漢語拆字字典
☆815Jan 8, 2023Updated 3 years ago
Xuanfang1121 / CRASpell_pytorch
View on GitHub
☆16Jun 18, 2022Updated 4 years ago
houbb / word-checker
View on GitHub
🇨🇳🇬🇧Chinese and English word spelling corrector.(中文易错别字检测，中文拼写检测纠正。英文单词拼写校验工具)
☆264Dec 8, 2024Updated last year
qingyujean / ssc
View on GitHub
基于“音形码”的中文字符串相似度计算方法
☆225Jul 24, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
destwang / CTCResources
View on GitHub
☆270Jul 26, 2024Updated last year
wdimmy / Automatic-Corpus-Generation
View on GitHub
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
☆295Oct 10, 2019Updated 6 years ago
howl-anderson / hanzi_char_featurizer
View on GitHub
汉字字符特征提取器 (featurizer)，提取汉字的特征（发音特征、字形特征）用做深度学习的特征｜ A Chinese character feature extractor, which extracts the features of Chinese charac…
☆301Dec 29, 2025Updated 6 months ago
wenyangchou / SimilarCharactor
View on GitHub
☆55Jun 7, 2021Updated 5 years ago
liuhuanyong / ChineseCixing
View on GitHub
WordForm,针对中文词语的笔画拆解，偏旁查询，拼音转换接口
☆67Aug 26, 2018Updated 7 years ago
gitabtion / SoftMaskedBert-PyTorch
View on GitHub
🙈 An unofficial implementation of SoftMaskedBert based on huggingface/transformers.
☆95Apr 26, 2021Updated 5 years ago
DaDaMrX / ReaLiSe
View on GitHub
A Multi-modal Model Chinese Spell Checker Released on ACL2021.
☆161Sep 21, 2023Updated 2 years ago
destwang / CTC2021
View on GitHub
☆129Nov 3, 2022Updated 3 years ago
nwmqpa / SSHWebSocket
View on GitHub
SSH Over websockets
☆12Jul 12, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HillZhang1999 / MuCGEC
View on GitHub
MuCGEC中文纠错数据集及文本纠错SOTA模型开源；Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…
☆570Jun 9, 2023Updated 3 years ago
argb / hanzi-data
View on GitHub
这个项目会收集、整理各种汉语字词相关的数据，比如常用汉字、词组的列表，常用汉字的词频统计数据、HSK大纲要求掌握的字词数据等。
☆18Nov 5, 2019Updated 6 years ago
thunlp / SubCharTokenization
View on GitHub
☆46Feb 5, 2023Updated 3 years ago
ShannonAI / ChineseBert
View on GitHub
Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"
☆568Jul 26, 2023Updated 2 years ago
Pay20Y / PIMNet
View on GitHub
☆16Jan 30, 2022Updated 4 years ago
ACL2020SpellGCN / SpellGCN
View on GitHub
SpellGCN
☆249Feb 28, 2021Updated 5 years ago
jiangnanboy / t5-onnx-corrector
View on GitHub
t5-model-onnx，中文拼写纠错，Chinese spelling correction。
☆15Sep 18, 2022Updated 3 years ago
shibing624 / pycorrector
View on GitHub
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。
☆6,494Jun 4, 2026Updated last month
System-T / DimSim
View on GitHub
☆127Mar 12, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nghuyong / Chinese-text-correction-papers
View on GitHub
text correction papers
☆316Jan 23, 2024Updated 2 years ago
houbb / opencc4j
View on GitHub
🇨🇳Open Chinese Convert is an opensource project for conversion between Traditional Chinese and Simplified Chinese.(java 中文繁简体转换，支持台湾、香港…
☆574Sep 5, 2025Updated 10 months ago
iqiyi / FASPell
View on GitHub
2019-SOTA简繁中文拼写检查工具：FASPell Chinese Spell Checker (Chinese Spell Check / 中文拼写检错 / 中文拼写纠错 / 中文拼写检查)
☆1,224Sep 3, 2022Updated 3 years ago
jiangnanboy / macbert-java-onnx
View on GitHub
MacBERT for Chinese Spelling Correction, macbert中文拼写纠错
☆16May 23, 2022Updated 4 years ago
liushulinle / PLOME
View on GitHub
Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021
☆242Aug 16, 2022Updated 3 years ago
charlesXu86 / char_featurizer
View on GitHub
汉字字符特征提取工具，可以提取出字符中的字音（声母、韵母、声调）、字形（偏旁、部首）、四角编码等特征，同时可作为tensor输入到模型
☆138May 25, 2020Updated 6 years ago
blcuicall / CCL2022-CLTC
View on GitHub
CCL 2022 汉语学习者文本纠错评测
☆142Dec 16, 2022Updated 3 years ago
changyi7231 / NFE
View on GitHub
A PyTorch implementation of Knowledge Graph Embedding by Normalizing Flows.
☆10Nov 22, 2022Updated 3 years ago
hiyoung123 / YoungCorrector
View on GitHub
基于规则的文本纠错系统。
☆121Jul 14, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
esun-ai / traditional-chinese-text-recogn-dataset
View on GitHub
繁體中文OCR文字識別數據集
☆89Jun 4, 2025Updated last year
0xqq / ETL-1
View on GitHub
数据基本清洗包括日期、时间、数值、字符串、字符、金钱、数据库（mysql、postgresql、mongodb、hbase、hdfsmemcached）、加解密（md5、sha、base64、aes、rsa）、文件、http服务、正则表达式等，后期会不断更新。
☆13Jul 25, 2018Updated 7 years ago
HillZhang1999 / NaSGEC
View on GitHub
Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)
☆96Feb 18, 2025Updated last year
tommyMessi / crnn_ctc-centerloss
View on GitHub
ctcloss + centerloss crnn text recognition
☆200Jan 28, 2021Updated 5 years ago
pyxploiter / deep-splerge
View on GitHub
Implementation of research paper "Deep Splitting and Merging for Table Structure Decomposition"
☆61Nov 9, 2022Updated 3 years ago
masr2000 / CLG-CGEC
View on GitHub
☆51Dec 1, 2023Updated 2 years ago
Actasidiot / EFIFSTR
View on GitHub
[ACM MM 2020] Exploring Font-independent Features for Scene Text Recognition
☆44Nov 30, 2020Updated 5 years ago