SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network
☆24Dec 9, 2023Updated 2 years ago
Alternatives and similar repositories for T2T-BinFormer
Users that are interested in T2T-BinFormer are comparing it to the libraries listed below
Sorting:
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆51Aug 5, 2024Updated last year
- Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks☆12Aug 12, 2025Updated 6 months ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆19Apr 9, 2025Updated 11 months ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Nov 11, 2024Updated last year
- ☆17Nov 21, 2019Updated 6 years ago
- ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along w…☆338Aug 22, 2024Updated last year
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆67Jun 6, 2024Updated last year
- ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for us…☆63Mar 12, 2025Updated 11 months ago
- Code for the paper "UVDoc: Neural Grid-based Document Unwarping" - Dataset capture and creation☆31May 27, 2024Updated last year
- Document Image Enhancement with GANs - TPAMI journal☆217Mar 24, 2023Updated 2 years ago
- ☆44Jul 9, 2024Updated last year
- A repository of the latest work related to underwater image enhancement (awaiting continuous updates). It provides relevant underwater im…☆18Jun 10, 2025Updated 8 months ago
- This is the dataset for the competition "Clinical Brain Computer Interfaces Challenge" to be held at WCCI 2020 at Glasgow. There are the …☆10Jan 20, 2022Updated 4 years ago
- [CVPR2024] Dataset and Code of "CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement".☆14Dec 14, 2024Updated last year
- Crawler based on a modified browser to detect online tracking.☆11Jul 19, 2023Updated 2 years ago
- ☆11Aug 17, 2014Updated 11 years ago
- The official code for “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser”, TMM, 2024.☆48Aug 26, 2024Updated last year
- A collection of papers and resources on scene text image super-resolution.☆107May 4, 2023Updated 2 years ago
- My personal solutions to the CS231n assignments (Spring 2019). CS231n: "CNN" is a Computer Vision class taught at Stanford.☆10Dec 8, 2022Updated 3 years ago
- A Novel Linear Array Pushbroom (LAP) Image Restoration Method. (Accepted by AAAI 2024)☆12Jan 17, 2024Updated 2 years ago
- ☆11Jan 1, 2024Updated 2 years ago
- Chat app for django built with django-channels☆10Dec 26, 2022Updated 3 years ago
- Awesome GAN-based Image Restoration☆12Mar 11, 2024Updated last year
- ☆10Feb 19, 2021Updated 5 years ago
- ☆10Oct 20, 2025Updated 4 months ago
- ☆14Nov 2, 2022Updated 3 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- An offline evaluation framework for sequence-based recommender systems☆13May 17, 2019Updated 6 years ago
- ☆13Oct 25, 2024Updated last year
- Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition☆16Jan 21, 2025Updated last year
- Official PyTorch implementation of DeepLIR.☆14Jun 5, 2025Updated 9 months ago
- 致力于AI for science的交叉学科融合。☆10Aug 18, 2024Updated last year
- A novel Vietnamese dataset for evaluating handwritten text image recognition methods☆16Sep 9, 2023Updated 2 years ago
- Inference, training and evaluation code for our models from the paper "Inv3D: a high-resolution 3D invoice dataset for template-guided si…☆59Feb 7, 2024Updated 2 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- Official code for our SIBGRAPI 2020 paper: "IDA: Improved Data Augmentation Applied to Salient Object Detection"☆14Oct 12, 2021Updated 4 years ago
- CoGS: Controllable Generation and Search from Sketch and Style☆13Mar 18, 2023Updated 2 years ago
- ☆12Mar 24, 2024Updated last year