Bhashini-IITJ / BharatSceneTextDataset
Large-Scale Scene Text Dataset for Indic Languages
☆11Updated this week
Alternatives and similar repositories for BharatSceneTextDataset:
Users that are interested in BharatSceneTextDataset are comparing it to the libraries listed below
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆11Updated 2 years ago
- ☆21Updated 2 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆82Updated last year
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Updated 2 years ago
- Hadwritten Text Recognition in Few-shot Scenario☆20Updated 2 years ago
- Pytorch implementation of image captioning using transformer-based model.☆65Updated last year
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆77Updated 9 months ago
- Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…☆53Updated 4 months ago
- ☆37Updated 10 months ago
- Official PyTorch Implementation of DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis - ICDAR 2021☆77Updated 3 years ago
- Towards Video Text Visual Question Answering: Benchmark and Baseline☆38Updated last year
- ☆37Updated last year
- ☆80Updated 3 weeks ago
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆69Updated last year
- Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]☆57Updated 2 years ago
- Official Implementation of Few-shot Visual Relationship Co-localization☆25Updated 3 years ago
- OCR Annotations from Amazon Textract for Industry Documents Library☆102Updated 2 years ago
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Updated 3 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆14Updated last year
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆59Updated 9 months ago
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆134Updated 2 weeks ago
- Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval☆13Updated 3 years ago
- A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research w…☆85Updated 4 months ago
- CapDec: SOTA Zero Shot Image Captioning Using CLIP and GPT2, EMNLP 2022 (findings)☆191Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting☆32Updated 2 weeks ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Updated 2 years ago
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆12Updated last month
- TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)☆72Updated last year
- WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching☆16Updated 3 years ago