CoinLQ / TripitakaCharacterDatasetLinks

从大藏经经文图片中切分出的单个字的图片数据集

☆9

Alternatives and similar repositories for TripitakaCharacterDataset

Users that are interested in TripitakaCharacterDataset are comparing it to the libraries listed below

Sorting:

tiantian91091317 / OCR-Corrector
利用语言模型，纠正OCR识别错误
☆469Updated 2 years ago
bitdata / ocrtable
Recognize tables and text from scanned images that contain tables. 从包含表格的扫描图片中识别表格和文字
☆256Updated 2 years ago
rockyzhengwu / document-ocr
一个相对完整的文档分析和识别项目
☆144Updated 5 years ago
Rid7 / OCR_DataSet
收集并整理有关OCR的数据集并统一标注格式，以便实验需要
☆12Updated 2 years ago
diaomin / crnn-mxnet-chinese-text-recognition
An implementation of CRNN (CNN+LSTM+warpCTC) on MxNet for chinese text recognition
☆215Updated 2 years ago
pavlo-melnyk / offline-HCCR
The official implementation of the "A high-performance CNN method for offline handwritten Chinese character recognition and visualization…
☆41Updated 3 years ago
howl-anderson / hanzi_char_featurizer
汉字字符特征提取器 (featurizer)，提取汉字的特征（发音特征、字形特征）用做深度学习的特征｜ A Chinese character feature extractor, which extracts the features of Chinese charac…
☆297Updated 4 years ago
wangchuan2008888 / cn-radical
提取中文的偏旁部首和拼音(一些生僻字的拼音没有补全,待优化)
☆43Updated 7 years ago
xiaofengShi / Image2Katex
公式图片ocr，输入图片输出对应的latex表达式
☆292Updated 5 years ago
HCIILAB / TKH_MTH_Datasets_Release
The Tripitaka Koreana in Han (TKH) Dataset and the Multiple Tripitaka in Han (MTH) Dataset for the research of Chinese character detectio…
☆64Updated 4 years ago
howl-anderson / hanzi_chaizi
汉字拆字库，可以将汉字拆解成偏旁部首，在机器学习中作为汉字的字形特征 | Hanzi Decomposition Library allows Chinese characters to be broken down into radicals and components…
☆384Updated 9 months ago
clumsycat / OpenCV-practice
some small functions Implemented in C ++ version
☆29Updated 9 years ago
shibing624 / judger
自动作文评分工具，支持中文、英文作文智能评分，支持评分模型自训练，支持WEKA处理模型数据，支持自定义评分算法。java开发。
☆53Updated 8 years ago
hwwu / ctpn-crnn
竖排书法汉字识别
☆64Updated 5 years ago
JaMesLiMers / Document_layout_analysis
Implementation code for document layout analysis (Hackathon 2020 in Suzhou)
☆81Updated 5 years ago
BiDAlab / DeepSignDB
☆25Updated last year
AlsoSprachZarathushtra / PDF_Form_OCR
Table Recognition and Content Extraction in PDF Files
☆23Updated 6 years ago
intel / handwritten-chinese-ocr-samples
End-to-end model training and deployment reference for handwritten Chinese text recognition, and can also be extended to other languages.
☆167Updated 2 years ago
yflv-yanxia / handwritten_text
☆81Updated last year
charlesXu86 / char_featurizer
汉字字符特征提取工具，可以提取出字符中的字音（声母、韵母、声调）、字形（偏旁、部首）、四角编码等特征，同时可作为tensor输入到模型
☆136Updated 5 years ago
JiangKui007 / jx_ocr
基于OPENCV和tesseract的中文扫描票据OCR识别。
☆94Updated 7 years ago
Walleclipse / ChineseAddress_OCR
Photographing Chinese-Address OCR implemented using CTPN+CTC+Address Correction. 拍照文档中文地址文字识别。
☆348Updated 5 years ago
beyondacm / Autochecker4Chinese
中文文本错别字检测以及自动纠错 / Autochecker & autocorrecter for chinese
☆296Updated 7 years ago
HCIILAB / MTHv2_Datasets_Release
☆64Updated 5 years ago
chineseocr / table-ocr
☆603Updated 11 months ago
HCIILAB / SCUT-EPT_Dataset_Release
The SCUT-EPT Dataset for the research of offline handwritten Chinese text recognition (HCTR) in educational documents has been released.
☆122Updated 4 years ago
JiangYanting / Pre-modern_Chinese_corpus_dataset
近代汉语语料库数据集自然语言处理语料库古代汉语古汉语文言文数字人文计算语言
☆165Updated 5 months ago
contr4l / SimilarCharacter
对常用的6700个汉字进行音、形比较，输出音近字、形近字的列表。 # 相近字
☆463Updated last year
tommyMessi / PST-table
表格结构解析新思路（表格识别新思路）
☆126Updated 4 years ago
Irene323 / GFTE
A GCN-based table structure recognition method
☆225Updated 5 years ago