Bhashini-IITJ / BharatSceneTextDatasetLinks
Large-Scale Scene Text Dataset for Indic Languages
☆14Updated last month
Alternatives and similar repositories for BharatSceneTextDataset
Users that are interested in BharatSceneTextDataset are comparing it to the libraries listed below
Sorting:
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆13Updated 2 months ago
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆11Updated 2 years ago
- Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…☆53Updated 7 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆25Updated last year
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 3 years ago
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆142Updated 2 months ago
- Pytorch implementation of image captioning using transformer-based model.☆66Updated 2 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆102Updated 2 years ago
- Official Implementation of Few-shot Visual Relationship Co-localization☆25Updated 3 years ago
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆80Updated 11 months ago
- [ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)☆41Updated last year
- Towards Video Text Visual Question Answering: Benchmark and Baseline☆38Updated last year
- ☆81Updated 3 months ago
- ☆38Updated last year
- Implementation of the "Learn No to Say Yes Better" paper.☆31Updated last week
- ☆24Updated last year
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆73Updated 8 months ago
- Hadwritten Text Recognition in Few-shot Scenario☆22Updated 2 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- ☆64Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆66Updated 9 months ago
- ☆21Updated 2 years ago
- Let there be clock in the beach - WACV 2022☆15Updated 3 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated last year
- (ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning☆35Updated 9 months ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informat…☆53Updated last year
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆53Updated 11 months ago
- Official implementation for Dessurt: Document end-to-end self-supervised understanding and recognition transformer☆60Updated 2 years ago
- ☆43Updated 2 years ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆53Updated last year