wanghaisheng/ocr-arxiv-daily

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wanghaisheng/ocr-arxiv-daily)

wanghaisheng / ocr-arxiv-daily

☆19

Alternatives and similar repositories for ocr-arxiv-daily

Users that are interested in ocr-arxiv-daily are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jfkuang / CFAM
View on GitHub
Contrast-guided Feature Adjustment Module for Visual Information Extraction
☆30May 23, 2023Updated 3 years ago
doc-analysis / DocBankLoader
View on GitHub
DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.
☆24Mar 17, 2021Updated 5 years ago
AprilYapingZhang / awesome-ocr
View on GitHub
☆18Apr 11, 2023Updated 3 years ago
H-Ambrose / NTable
View on GitHub
a dataset for camera-based table detection
☆16Jul 30, 2021Updated 4 years ago
LivingSkyTechnologies / Document_Layout_Segmentation
View on GitHub
Repository to use/train segmentation models for document layout analysis
☆19Jan 13, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
AILab-UniFI / cte-dataset
View on GitHub
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 3 years ago
zhaominyiz / STIRER
View on GitHub
STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023
☆14Dec 2, 2024Updated last year
namtuanly / WikiTableSet
View on GitHub
WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia
☆32Jun 12, 2025Updated last year
Pay20Y / PIMNet
View on GitHub
☆16Jan 30, 2022Updated 4 years ago
jfma-USTC / HRDoc
View on GitHub
Dataset and scripts for HRDoc
☆42Jun 21, 2023Updated 3 years ago
Xiaomeng-Yang / STR_benchmark_cleansed
View on GitHub
☆14May 26, 2023Updated 3 years ago
MAEHCM / AET
View on GitHub
Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”
☆18Dec 6, 2022Updated 3 years ago
uakarsh / TiLT-Implementation
View on GitHub
Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.
☆18Apr 23, 2023Updated 3 years ago
herobd / dessurt
View on GitHub
Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer
☆62Jan 11, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
DCGM / SoftCTC
View on GitHub
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Mar 7, 2023Updated 3 years ago
MaxKinny / TabRecSet
View on GitHub
A large scale camera-taken table detection and recognition dataset.
☆151Apr 9, 2026Updated 3 months ago
SakuraRiven / LANMS
View on GitHub
Locality-Aware Non-Maximum Suppression (C++ version)
☆23Aug 31, 2021Updated 4 years ago
sparkfish / augraphy
View on GitHub
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
☆560Jul 20, 2025Updated last year
allanj / LayoutLMv3-DocVQA
View on GitHub
Example codebase for fine-tuning layoutLMv3 on DocVQA
☆53Sep 19, 2022Updated 3 years ago
SCUT-DLVCLab / Document-AI-Recommendations
View on GitHub
Algorithms, papers, datasets, performance comparisons for Document AI.
☆209Mar 1, 2025Updated last year
VDIGPKU / STR_TPSearch
View on GitHub
☆21Mar 15, 2022Updated 4 years ago
facebookresearch / MultiplexedOCR
View on GitHub
Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR
☆80Dec 2, 2022Updated 3 years ago
zhaominyiz / EPiDA
View on GitHub
Official Code for 'EPiDA: An Easy Plug-in Data Augmentation Framework for High Performance Text Classification' - NAACL 2022
☆23May 9, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ymy-k / Hi-SAM
View on GitHub
[IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
☆368May 30, 2025Updated last year
jpWang / LiLT
View on GitHub
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…
☆366Oct 31, 2022Updated 3 years ago
segmind / cral
View on GitHub
Open Source Deep Learning Computer Vision (DLCV) Library
☆16Nov 26, 2020Updated 5 years ago
adriangrepo / segmentl
View on GitHub
Loss functions for Image Segmentation
☆12Mar 26, 2020Updated 6 years ago
LivingSkyTechnologies / Dense_Article_Dataset_DAD
View on GitHub
Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis
☆16Jan 13, 2022Updated 4 years ago
Pavansomisetty21 / Text-to-Images-Leveraging-Flux-AI-for-Text-to-Image-Generation
View on GitHub
we explores the fascinating domain of text-to-image generation using the powerful capabilities of the Flux API. The objective is to trans…
☆12Aug 14, 2024Updated last year
xudonmao / Multi-class_GAN
View on GitHub
☆14Nov 15, 2016Updated 9 years ago
IvanKuchin / pancreas_segmentation
View on GitHub
Tensorflow implementation of a 3D-CNN U-net with Grid Attention and DSV for pancreas segmentation trained on CT-82.
☆11Dec 31, 2024Updated last year
DandanGuo1993 / reweight-imbalance-classification-with-OT
View on GitHub
☆13Nov 8, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TencentARC / BTS
View on GitHub
BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
☆33Apr 16, 2024Updated 2 years ago
MAILAB-Yonsei / capsule_endoscopy_detection
View on GitHub
capsule_endoscopy_detection DACON challenge
☆15Dec 9, 2021Updated 4 years ago
weidafeng / TableCell
View on GitHub
Image-based table cell detection: a new dataset and an improved detection method.
☆55Jul 2, 2020Updated 6 years ago
ihaeyong / Maximum-Margin-LDAM
View on GitHub
Learning Imbalanced Datasets With Maximum Margin Losss
☆11Jun 17, 2023Updated 3 years ago
jacobkoenig / clDice-Loss
View on GitHub
Implementation of clDice - a Novel Connectivity-Preserving Loss Function for Vessel Segmentation (2019) in Keras/Tensorflow
☆13Apr 22, 2020Updated 6 years ago
ysig / learnable-typewriter
View on GitHub
The Learnable Typewriter: A Generative Approach to Text Line Analysis
☆34Oct 31, 2024Updated last year
phucty / wtabhtml
View on GitHub
Tool to parse wiki tables from the HTML dump of Wikipedia
☆11Jun 12, 2022Updated 4 years ago