UranusSeven / qing_bureau_of_construction
This project employs Optical Character Recognition (OCR) to digitize historical records from the Qing manufacturing office.
☆25Updated 2 years ago
Alternatives and similar repositories for qing_bureau_of_construction
Users that are interested in qing_bureau_of_construction are comparing it to the libraries listed below
Sorting:
- ☆297Updated last month
- A cute toolkit for OCR with GUI, including image preprocessing and text recognition. Works out of the box. 一只小小的OCR工具箱,包括图像预处理和文字识别等功能,…☆13Updated last year
- 基于汇文明朝体,并以旧字形为标准的对中日韩越统一表意文字扩展区进行字形补充的项目。☆56Updated last week
- ancient-chat-llm: A LLM which is proficient in Chinese culture 古语说: 一个精通中国文化的大模型☆43Updated last year
- Catalog files for Kanseki Repository 各種漢籍目録☆35Updated 6 years ago
- Formula recognition based on LaTeX-OCR and ONNXRuntime.☆348Updated 6 months ago
- 繁體中文OCR文字識別數據集☆77Updated 3 years ago
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆50Updated 8 months ago
- Videos Transcription and Translation with Faster Whisper and ChatGPT☆243Updated last year
- 古籍影文: 中文古籍开放数据集仓库☆14Updated last year
- doc2x docs☆57Updated 5 months ago
- The latest SQLite version of the China Biographical Database☆126Updated 8 months ago
- Library classification systems such as Library of Congress Classification, Chinese Library Classification (《中国图书馆分类法》).☆69Updated 5 years ago
- Based on RapidOCR, extract the PDF content☆166Updated last week
- 《现代汉语词典》(第7版)全文TXT☆267Updated 10 months ago
- Using GPT to parse PDF☆97Updated 8 months ago
- zi2zi implement with pytorch☆206Updated last year
- Cantonese Video Transcribe Service☆15Updated 4 months ago
- 通过paddle ocr实现pdf转markdown☆69Updated 7 months ago
- Table Structure Recognition☆19Updated 9 months ago
- ☆182Updated 6 months ago
- 开发一条从字体生成到字体制作的工具链,快速实现个性化订制字体制作。☆17Updated 4 months ago
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆193Updated 6 months ago
- chinese NLP corpus of chinese science fiction, chinese science fiction corpus: Archive of the Ark Plan of Ula Science Fiction Website 乌拉科…☆112Updated 2 years ago
- OCR pre-processing Toolbox☆16Updated 2 years ago
- Codebase for fine-tuning / evaluating nougat-based image2latex generation models☆152Updated 7 months ago
- End-to-end model training and deployment reference for handwritten Chinese text recognition, and can also be extended to other languages.☆165Updated 2 years ago
- ❤️中华民族二十四史:史记,汉书,后汉书,三国志等。☆22Updated last year
- 研究GOT-OCR-项目落地加速,不限语言☆60Updated 6 months ago
- Retrained Tesseract OCR model for Chinese☆109Updated 2 years ago