qingyujean/ssc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qingyujean/ssc)

qingyujean / ssc

基于“音形码”的中文字符串相似度计算方法

☆225

Alternatives and similar repositories for ssc

Users that are interested in ssc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wenyangchou / SimilarCharactor
View on GitHub
☆55Jun 7, 2021Updated 5 years ago
contr4l / SimilarCharacter
View on GitHub
对常用的6700个汉字进行音、形比较，输出音近字、形近字的列表。 # 相近字
☆482Mar 28, 2024Updated 2 years ago
charlesXu86 / char_featurizer
View on GitHub
汉字字符特征提取工具，可以提取出字符中的字音（声母、韵母、声调）、字形（偏旁、部首）、四角编码等特征，同时可作为tensor输入到模型
☆138May 25, 2020Updated 6 years ago
wdimmy / Automatic-Corpus-Generation
View on GitHub
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
☆295Oct 10, 2019Updated 6 years ago
shibing624 / pycorrector
View on GitHub
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。
☆6,495Updated this week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
howl-anderson / hanzi_char_featurizer
View on GitHub
汉字字符特征提取器 (featurizer)，提取汉字的特征（发音特征、字形特征）用做深度学习的特征｜ A Chinese character feature extractor, which extracts the features of Chinese charac…
☆301Dec 29, 2025Updated 7 months ago
jiangnanboy / t5-onnx-corrector
View on GitHub
t5-model-onnx，中文拼写纠错，Chinese spelling correction。
☆15Sep 18, 2022Updated 3 years ago
hiyoung123 / YoungCorrector
View on GitHub
基于规则的文本纠错系统。
☆121Jul 14, 2021Updated 5 years ago
iqiyi / FASPell
View on GitHub
2019-SOTA简繁中文拼写检查工具：FASPell Chinese Spell Checker (Chinese Spell Check / 中文拼写检错 / 中文拼写纠错 / 中文拼写检查)
☆1,224Sep 3, 2022Updated 3 years ago
zhanzecheng / Time_NLP
View on GitHub
Time-NLP的python3版本中文时间表达词转换
☆520Dec 8, 2022Updated 3 years ago
gitabtion / SoftMaskedBert-PyTorch
View on GitHub
🙈 An unofficial implementation of SoftMaskedBert based on huggingface/transformers.
☆95Apr 26, 2021Updated 5 years ago
ACL2020SpellGCN / SpellGCN
View on GitHub
SpellGCN
☆249Feb 28, 2021Updated 5 years ago
houbb / nlp-hanzi-similar
View on GitHub
The hanzi similar tool.(汉字相似度计算工具，中文形近字算法。可用于手写汉字识别纠正，文本混淆等。)
☆298Feb 28, 2024Updated 2 years ago
Xuanfang1121 / CRASpell_pytorch
View on GitHub
☆16Jun 18, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
FDChongLi / TwoWaysToImproveCSC
View on GitHub
This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".
☆68May 31, 2021Updated 5 years ago
Ailln / cn2an
View on GitHub
📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）
☆765Apr 23, 2026Updated 3 months ago
taozhijiang / chinese_correct_wsd
View on GitHub
简易的中文纠错和消歧
☆290Aug 19, 2015Updated 10 years ago
CLUEbenchmark / LightLM
View on GitHub
高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task
☆60Jun 1, 2020Updated 6 years ago
destwang / CTC2021
View on GitHub
☆129Nov 3, 2022Updated 3 years ago
biendata-com / sohu2021-baseline
View on GitHub
☆11Mar 30, 2021Updated 5 years ago
kfcd / chaizi
View on GitHub
漢語拆字字典
☆817Jan 8, 2023Updated 3 years ago
ukiuki-satoshi / visedit
View on GitHub
python library for visualization string edit distance
☆10Oct 15, 2021Updated 4 years ago
howl-anderson / hanzi_chaizi
View on GitHub
汉字拆字库，可以将汉字拆解成偏旁部首，在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…
☆423Dec 29, 2025Updated 7 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
bojone / text_compare
View on GitHub
用python比较两个字符串差异，高亮差异部分
☆27Jul 20, 2020Updated 6 years ago
liuhuanyong / ChineseCixing
View on GitHub
WordForm,针对中文词语的笔画拆解，偏旁查询，拼音转换接口
☆67Aug 26, 2018Updated 7 years ago
Wall-ee / chinese2digits
View on GitHub
最好的汉字数字(中文数字)-阿拉伯数字转换工具。包含"点二八"，"负百分之四十"等众多汉语表达方法。NLP，机器人工程必备！ The Best Tool of Chinese Number to Digits
☆373Mar 26, 2023Updated 3 years ago
liushulinle / PLOME
View on GitHub
Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021
☆243Aug 16, 2022Updated 3 years ago
liuhuanyong / ChineseEmbedding
View on GitHub
Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量
☆455Dec 15, 2018Updated 7 years ago
ccheng16 / correction
View on GitHub
Chinese "spelling" error correction
☆265Nov 28, 2017Updated 8 years ago
panchunguang / ccks_baidu_entity_link
View on GitHub
ccks baidu entity link 实体链接第一名
☆841Dec 19, 2023Updated 2 years ago
sunyilgdx / SIFRank_zh
View on GitHub
Keyphrase or Keyword Extraction 基于预训练模型的中文关键词抽取方法（论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained La…
☆431May 17, 2020Updated 6 years ago
ShannonAI / ChineseBert
View on GitHub
Code for ACL 2021 paper "ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information"
☆569Jul 26, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
System-T / DimSim
View on GitHub
☆127Mar 12, 2021Updated 5 years ago
brightmart / xlnet_zh
View on GitHub
中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large
☆228Sep 13, 2019Updated 6 years ago
mattzheng / py-kenlm-model
View on GitHub
python | 高效使用统计语言模型kenlm：新词发现、分词、智能纠错等
☆172Sep 27, 2019Updated 6 years ago
qiangsiwei / bert_distill
View on GitHub
BERT distillation（基于BERT的蒸馏实验）
☆316Jul 30, 2020Updated 5 years ago
benywon / ChineseBert
View on GitHub
This is a chinese Bert model specific for question answering
☆27Aug 8, 2019Updated 6 years ago
tongchangD / text_data_enhancement_with_LaserTagger
View on GitHub
Modify Chinese text, modified on LaserTagger Model. 文本复述，基于lasertagger做中文文本数据增强。
☆320Jan 3, 2024Updated 2 years ago
ymcui / Chinese-ELECTRA
View on GitHub
Pre-trained Chinese ELECTRA（中文ELECTRA预训练模型）
☆1,433Apr 19, 2026Updated 3 months ago