neumason / Font-ComponentLinks
西方学者普遍从汉字部件出发理解汉字,该库给出了中文部件分解的详细说明和数据库。
☆11Updated 2 years ago
Alternatives and similar repositories for Font-Component
Users that are interested in Font-Component are comparing it to the libraries listed below
Sorting:
- IDS data for CJK Unified Ideographs☆452Updated 2 years ago
- ☆14Updated 5 months ago
- zi2zi implement with pytorch☆212Updated last year
- 获取中文的笔画向量☆27Updated 3 years ago
- ☆23Updated 3 months ago
- 一种汉字字体生成算法☆13Updated last year
- 汉字拆字库,可以将汉字拆解成偏旁部首,在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…☆386Updated 9 months ago
- Instance Segmentation for Chinese Character Stroke Extraction, Datasets and Benchmarks.☆84Updated 2 years ago
- Yet another IDS (Ideographic Description Sequences) lists with MIT license☆127Updated 3 months ago
- Digitalization of the Table of General Standard Chinese Characters☆31Updated 8 months ago
- AI-assisted Deciphering Oracle Bone Script☆54Updated 7 months ago
- Ideographic Description Sequence Checker Tools☆24Updated 8 years ago
- Radical Analysis Network for Learning Hierarchies of Chinese Characters☆54Updated 5 years ago
- ☆65Updated 5 years ago
- This is a pre-trained LSTM model. This model can help you to segment unpunctuated historical Chinese texts. 這是基於 LSTM 的預訓練模型。此模型可幫助您為漢語古文…☆27Updated 3 years ago
- 研究所有汉字的结构,为NLP中汉字结构问题提供完备的解。☆16Updated last year
- ☆23Updated last year
- ☆93Updated 3 years ago
- GuwenModels: 古文自然语言处理模型合集, 收录互联网上的古文相关模型及资源. A collection of Classical Chinese natural language processing models, including Classical Ch…☆185Updated last year
- 汉字自动拆分系统开发☆102Updated last year
- 基于ChineseAlpaca微调的,专精与古汉语翻译、古汉语断句的大语言模型☆20Updated last year
- An enhanced zi2zi project with word-oriented data augmentation, feature combination, and transfer learning.☆39Updated 6 years ago
- Han character library for CJKV languages☆159Updated 4 years ago
- 古籍影文: 中文古籍开放数据集仓库☆21Updated last year
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆18Updated last year
- 古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆53Updated last year
- 汉字组件笔画数据☆15Updated 7 years ago
- W-Net-MSMC Initial commit☆34Updated 5 years ago
- CDLA: A Chinese document layout analysis (CDLA) dataset☆273Updated 3 years ago
- Decomposition data for 75,000 CJK ideographs; fork (with fixes) of☆70Updated 7 years ago