A Large Dataset of Historical Japanese Documents with Complex Layouts
☆37Jun 23, 2026Updated last week
Alternatives and similar repositories for HJDataset
Users that are interested in HJDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆30May 8, 2025Updated last year
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 6 months ago
- ☆33Dec 18, 2025Updated 6 months ago
- ☆10Nov 21, 2023Updated 2 years ago
- Hadwritten Text Recognition in Few-shot Scenario☆22Mar 25, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆17Feb 16, 2023Updated 3 years ago
- SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition☆10Apr 8, 2024Updated 2 years ago
- Official repository accompaying the ICDAR 2023 paper☆14Oct 3, 2023Updated 2 years ago
- FIWARE 401: IDM - Managing Users and Organizations☆10May 15, 2026Updated last month
- 粤港澳大湾区(黄埔)国际算法算例大赛-古籍文档图像识别与分析算法比赛 Alphx队源码☆46Mar 16, 2023Updated 3 years ago
- This repository has code to scrape FINRA Trade data☆10Oct 15, 2019Updated 6 years ago
- ☆10Jan 22, 2023Updated 3 years ago
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆105Dec 9, 2021Updated 4 years ago
- The Tripitaka Koreana in Han (TKH) Dataset and the Multiple Tripitaka in Han (MTH) Dataset for the research of Chinese character detectio…☆72Sep 23, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Jan 15, 2026Updated 5 months ago
- ☆15Sep 27, 2022Updated 3 years ago
- These are a set of data definitions for harmonising the data from IoT and related context data sources. They have been developed through …☆16Oct 3, 2019Updated 6 years ago
- Pyramid Mask Text Detector designed by SenseTime Video Intelligence Research team.☆14Aug 1, 2019Updated 6 years ago
- A lightweight framework for evaluating visual-language models.☆42Apr 20, 2026Updated 2 months ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 5 years ago
- ☆17Nov 6, 2025Updated 7 months ago
- ☆71Jul 6, 2020Updated 5 years ago
- Package that compiles the microsoft dxgkrnl driver from WSL Kernel for using partitioned GPUs from hyperV☆19Jun 29, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- version 4.x of the Princeton Geniza Project☆13Jun 12, 2026Updated 2 weeks ago
- A simple wrapper library for binding timm models as detectron2 backbones☆45May 31, 2023Updated 3 years ago
- ☆27Jul 3, 2025Updated 11 months ago
- ☆101Dec 8, 2022Updated 3 years ago
- ☆14Jan 11, 2013Updated 13 years ago
- ☆25Nov 21, 2023Updated 2 years ago
- Miqra According to the Masorah in two JSON formats☆12Jun 4, 2026Updated 3 weeks ago
- ☆14Sep 6, 2023Updated 2 years ago
- ☆27May 22, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A tool for improving the output of generic Arabic OCR systems using an n-gram based post-correction approach.☆10Sep 22, 2021Updated 4 years ago
- Notes and information for building the WSL-Kernel module and setting up GPU-PV in Linux guests.☆16Mar 22, 2026Updated 3 months ago
- Use any vision LLMs to perform OCR using LangChain☆23Jul 29, 2025Updated 11 months ago
- Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks☆13Jun 16, 2020Updated 6 years ago
- Local media center for those who want to control what they watch☆13Nov 6, 2025Updated 7 months ago
- Obsolete repo, merged into eynollah☆12Sep 29, 2025Updated 9 months ago
- Attention-based sequence-to-sequence model for handwritten word recognition☆65Sep 22, 2024Updated last year