Synthesize image datasets of documents in natural scenes with Python+Blender3D
☆60Aug 7, 2022Updated 3 years ago
Alternatives and similar repositories for sim2real-docs
Users that are interested in sim2real-docs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- This repository is a concise collection of well known deep learning based document binarization models.☆29Dec 24, 2022Updated 3 years ago
- NoiseGrad (and its extension NoiseGrad++) is a method to enhance explanations of artificial neural networks by adding noise to model weig…☆22May 11, 2023Updated 2 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated 2 years ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆21Dec 4, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆29Dec 18, 2025Updated 4 months ago
- A Visualizer for prosodically annotated speech corpora☆12Oct 27, 2021Updated 4 years ago
- DIAR software for synthetic document image and groundtruth generation, with various degradation models for data augmentation☆137Nov 29, 2023Updated 2 years ago
- ☆15Dec 11, 2021Updated 4 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- Low level control interface.☆15Jun 5, 2025Updated 10 months ago
- Contrast-guided Feature Adjustment Module for Visual Information Extraction☆30May 23, 2023Updated 2 years ago
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆17Jul 16, 2020Updated 5 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- Code for the ICDAR2021 paper "Visual FUDGE: Form Understanding via Dynamic Graph Editing"☆33Mar 4, 2022Updated 4 years ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- ☆18Jul 9, 2024Updated last year
- Intuitive interface for fine-tuning and retraining a Tesseract OCR language model☆10Jul 4, 2025Updated 10 months ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Sep 22, 2023Updated 2 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆33Jul 1, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆16Feb 16, 2023Updated 3 years ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Apr 24, 2024Updated 2 years ago
- [TIM 2025] Towards Accurate Readings of Water Meters by Eliminating Transition Error: New Dataset and Effective Solution☆16Mar 5, 2025Updated last year
- Simple reflection of expressions☆34Jun 18, 2021Updated 4 years ago
- The source codes of TDv2 in paper: TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition.☆12Jul 28, 2022Updated 3 years ago
- Scripted fact system for SuperScript☆11Sep 15, 2017Updated 8 years ago
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆311Dec 2, 2024Updated last year
- Code for the paper 'A Biologically Inspired Visual Working Memory for Deep Networks'☆21Aug 6, 2019Updated 6 years ago
- Similarity Encoder (SimEc) Neural Network Framework for learning low dimensional similarity preserving representations☆17Jun 28, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 🤖 Autonomous Drone DJI Tello with OpenCv and ImageAI☆11Jun 18, 2019Updated 6 years ago
- ☆44Jul 9, 2024Updated last year
- PyTorch CTC Decoder bindings☆42Jan 31, 2018Updated 8 years ago
- The official implement of CTRNet++.☆15Dec 30, 2024Updated last year
- Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters☆16May 30, 2024Updated last year
- [CVPR'25] Conformal prediction for vision-language models. Enhancing VLMs deployment with reliability gurarantees.☆21Jun 7, 2025Updated 10 months ago
- Code for the Paper 'On the Connection Between Adversarial Robustness and Saliency Map Interpretability' by C. Etmann, S. Lunz, P. Maass, …☆16May 9, 2019Updated 6 years ago