A Large Dataset of Historical Japanese Documents with Complex Layouts
☆37Apr 8, 2026Updated last month
Alternatives and similar repositories for HJDataset
Users that are interested in HJDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆30May 8, 2025Updated last year
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 5 months ago
- ☆32Dec 18, 2025Updated 5 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- ☆10Nov 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Hadwritten Text Recognition in Few-shot Scenario☆22Mar 25, 2023Updated 3 years ago
- ☆16Feb 16, 2023Updated 3 years ago
- SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition☆10Apr 8, 2024Updated 2 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- This repository contains digitized data from Pakistan Bureau of Statistics's 2017 Census results. We converted them to csv format to help…☆10Nov 11, 2021Updated 4 years ago
- 粤港澳大湾区(黄埔)国际算法算例大赛-古籍文档图像识别与分析算法比赛 Alphx队源码☆46Mar 16, 2023Updated 3 years ago
- ☆12Jun 24, 2022Updated 3 years ago
- ☆10Jan 22, 2023Updated 3 years ago
- ☆13Nov 8, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Save Stata dataset in LaTeX format☆13Feb 4, 2026Updated 3 months ago
- A module for Omeka S that provides an API for the Neatline 3 single page application☆18Mar 26, 2023Updated 3 years ago
- The Tripitaka Koreana in Han (TKH) Dataset and the Multiple Tripitaka in Han (MTH) Dataset for the research of Chinese character detectio…☆72Sep 23, 2020Updated 5 years ago
- ☆14Jan 15, 2026Updated 4 months ago
- ☆30Dec 20, 2021Updated 4 years ago
- ☆13May 6, 2026Updated 2 weeks ago
- Template code for exporting Stata regression output to beautiful LaTeX tables☆18May 27, 2016Updated 9 years ago
- Pyramid Mask Text Detector designed by SenseTime Video Intelligence Research team.☆14Aug 1, 2019Updated 6 years ago
- A lightweight framework for evaluating visual-language models.☆41Apr 20, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 古籍识别☆15May 19, 2021Updated 5 years ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- Enforce exact/minimum versions of community-contributed packages.☆19May 10, 2024Updated 2 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- ☆71Jul 6, 2020Updated 5 years ago
- デジタル 化資料から作成したOCRテキストデータのngram頻度統計情報のデータセット☆15Jan 10, 2023Updated 3 years ago
- Package that compiles the microsoft dxgkrnl driver from WSL Kernel for using partitioned GPUs from hyperV☆19Jun 29, 2024Updated last year
- version 4.x of the Princeton Geniza Project☆12May 11, 2026Updated last week
- ☆25Nov 21, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Miqra According to the Masorah in two JSON formats☆12Updated this week
- Digital texts in Prakrit☆10Sep 14, 2025Updated 8 months ago
- A tool for improving the output of generic Arabic OCR systems using an n-gram based post-correction approach.☆10Sep 22, 2021Updated 4 years ago
- 100DaysOfCode☆11Aug 18, 2020Updated 5 years ago
- Notes and information for building the WSL-Kernel module and setting up GPU-PV in Linux guests.☆17Mar 22, 2026Updated last month
- Use any vision LLMs to perform OCR using LangChain☆22Jul 29, 2025Updated 9 months ago
- Marine Obstacle Detection Benchmark - Evaluation and Visualization Scripts☆35Mar 25, 2026Updated last month