[ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration"
☆60Apr 13, 2026Updated last month
Alternatives and similar repositories for AutoHDR
Users that are interested in AutoHDR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Ca…☆81May 18, 2026Updated 3 weeks ago
- [AAAI 2026 Oral] The official GitHub page of "PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Bas…☆70Apr 19, 2026Updated last month
- Reproducing the Past: A Dataset for Benchmarking Inscription Restoration (ACM MM'24)☆14Oct 15, 2025Updated 7 months ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆68Jun 6, 2024Updated 2 years ago
- ☆18Jul 24, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper im…☆21Dec 5, 2023Updated 2 years ago
- [EMNLP 2024] TongGu, a classical Chinese language model.☆69Sep 28, 2024Updated last year
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆21Dec 4, 2024Updated last year
- ☆18Dec 10, 2023Updated 2 years ago
- [IJCV 2025] Smaller But Better: Unifying Layout Generation with Smaller Large Language Models☆158Aug 3, 2025Updated 10 months ago
- ☆22May 30, 2023Updated 3 years ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆126Nov 13, 2023Updated 2 years ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆53Aug 5, 2024Updated last year
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.☆41Apr 7, 2025Updated last year
- [arXiv 25] OCRGenBench: A Comprehensive Benchmark for Evaluating OCR Generative Capabilities☆272Apr 13, 2026Updated last month
- ☆88May 31, 2026Updated last week
- [arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?☆35Dec 1, 2025Updated 6 months ago
- [PR 2024] HTQ: Exploring the High-Dimensional Trade-Off of Mixed-Precision Quantization☆12Jul 16, 2024Updated last year
- [𝗜𝗖𝗔𝗦𝗦𝗣 𝟮𝟬𝟮𝟱 𝗢𝗿𝗮𝗹] ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sa…☆15May 2, 2026Updated last month
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆31Jun 3, 2025Updated last year
- ☆14Jan 15, 2026Updated 4 months ago
- Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks☆12Aug 12, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated last year
- ☆16Sep 22, 2023Updated 2 years ago
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text remova…☆274Dec 19, 2024Updated last year
- 古籍识别☆15May 19, 2021Updated 5 years ago
- [ICASSP 2025 Oral] ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sampled Longitud…☆29Feb 14, 2026Updated 3 months ago
- The official implement of CTRNet++.☆15Dec 30, 2024Updated last year
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- ☆17Nov 6, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [TAI 2023] Appearance Enhancement for Camera-captured Document Images in the Wild☆58Aug 28, 2025Updated 9 months ago
- [EACL 2023] Transfer Knowledge from Natural Language to Electrocardiography: Can We Detect Cardiovascular Disease Through Language Models…☆17May 7, 2024Updated 2 years ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆25Mar 29, 2024Updated 2 years ago
- Cookbook for Crafting Good Code☆57Mar 19, 2024Updated 2 years ago
- [BIBM 2025] ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in ECG☆37Jun 2, 2026Updated last week
- ☆48Feb 7, 2025Updated last year
- This repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diff…☆55May 21, 2025Updated last year