furkanbiten/stvqa_amazon_ocr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/furkanbiten/stvqa_amazon_ocr)

furkanbiten / stvqa_amazon_ocr

STVQA and TextVQA OCR results from Amazon Text in Image pipeline

☆12

Alternatives and similar repositories for stvqa_amazon_ocr

Users that are interested in stvqa_amazon_ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uakarsh / latr
View on GitHub
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal architecture for Scene Text Visual Question Answer…
☆56Updated this week
furkanbiten / idl_data
View on GitHub
OCR Annotations from Amazon Textract for Industry Documents Library
☆103Aug 20, 2022Updated 3 years ago
AndresPMD / StacMR
View on GitHub
Scene Text Aware Cross Modal Retrieval (StacMR)
☆24Sep 3, 2021Updated 4 years ago
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
aaberdam / Holistic_Pursuit
View on GitHub
An implementation of the Holistic Pursuit for the Multi-Layer Sparse Coding model. Contains a comparison to the projection pursuit algori…
☆19Dec 19, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
amazon-science / visfocus
View on GitHub
☆24Apr 29, 2025Updated last year
AndresPMD / Clip_CMR
View on GitHub
CLIP-based simple image-text matching baseline for COCO and F30K
☆15Sep 16, 2021Updated 4 years ago
AndresPMD / semantic_adaptive_margin
View on GitHub
WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
☆16Dec 10, 2021Updated 4 years ago
AndresPMD / Fine_Grained_Clf
View on GitHub
Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
☆25Nov 15, 2021Updated 4 years ago
ChenyuGAO-CS / SMA
View on GitHub
The imdb files with SBD-Trans OCR for TextVQA dataset.
☆11Nov 30, 2021Updated 4 years ago
AndresPMD / Pytorch-yolo-phoc
View on GitHub
Implementation on pytorch of the code from the ECCV 2018 paper - Single Shot Scene Text Retrieval
☆13Dec 15, 2021Updated 4 years ago
furkanbiten / object-bias
View on GitHub
Let there be clock in the beach - WACV 2022
☆15Nov 15, 2021Updated 4 years ago
amazon-science / semimtr-text-recognition
View on GitHub
Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)
☆83Sep 12, 2023Updated 2 years ago
microsoft / TAP
View on GitHub
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)
☆72May 22, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
aaberdam / AdaLISTA
View on GitHub
Ada-LISTA: Learned Solvers Adaptive to Varying Models
☆11Feb 18, 2020Updated 6 years ago
amazon-science / glass-text-spotting
View on GitHub
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Jun 28, 2024Updated 2 years ago
MCLAB-OCR / KnowledgeMiningWithSceneText
View on GitHub
☆38Feb 4, 2023Updated 3 years ago
AndresPMD / GCN_classification
View on GitHub
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
☆65Dec 1, 2022Updated 3 years ago
furkanbiten / SelectiveTextStyleTransfer
View on GitHub
ICDAR 2019
☆25Aug 2, 2019Updated 6 years ago
phantrdat / cvpr20-scatter-text-recognizer
View on GitHub
Unofficial implementation of CVPR 2020 paper "SCATTER: Selective Context Attentional Scene Text Recognizer"
☆66Mar 3, 2022Updated 4 years ago
evanmiltenburg / MeasureDiversity
View on GitHub
Measure the diversity of image descriptions, repository for our COLING 2018 paper.
☆13Dec 29, 2019Updated 6 years ago
xiaojino / RUArt
View on GitHub
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
☆10Nov 27, 2022Updated 3 years ago
furkanbiten / GoodNews
View on GitHub
Good News Everyone! - CVPR 2019
☆130Apr 14, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wangkai930418 / HCV_IIRC
View on GitHub
code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"
☆15Oct 28, 2022Updated 3 years ago
phiyodr / vqaloader
View on GitHub
PyTorch DataLoader for many VQA datasets
☆15Jan 10, 2023Updated 3 years ago
silvery107 / nmt-multi30k-pytorch
View on GitHub
Neural Machine Translation with Transformer on Multi30K
☆11Aug 27, 2021Updated 4 years ago
ronghanghu / vqa-maskrcnn-benchmark-m4c
View on GitHub
Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…
☆13Jan 30, 2020Updated 6 years ago
biswassanket / DocSegTr
View on GitHub
A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers
☆59Sep 9, 2024Updated last year
mailcorahul / auto_labeler
View on GitHub
auto_labeler - An all-in-one library to automatically label vision data
☆22Jan 17, 2025Updated last year
xinke-wang / Awesome-Text-VQA
View on GitHub
☆188May 8, 2024Updated 2 years ago
ashishjamarkattel / reinforment-learning-with-human-feedback
View on GitHub
☆17Dec 31, 2023Updated 2 years ago
facebookresearch / TextVQA
View on GitHub
Website for TextVQA dataset.
☆30Apr 30, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lluisgomez / single-shot-str
View on GitHub
Single Shot Scene Text Retrieval, ECCV 2018. L. Gomez*, A. Mafla*, M. Rusiñol, D. Karatzas.
☆68May 13, 2019Updated 7 years ago
yuranusduke / CMT-Convolutional-NN-Meets-ViT
View on GitHub
Pytorch unofficial implementation of CMT
☆13Jul 16, 2021Updated 5 years ago
RhapsodyAILab / Awesome-MiniCPMV-Projects
View on GitHub
☆11Aug 19, 2024Updated last year
jsulam / ml-ista
View on GitHub
Demo for Multi-Layer ISTA and Multi-Layer FISTA algorithms for convolutional neural networks, as described in J. Sulam, A. Aberdam, A. Be…
☆29Nov 20, 2018Updated 7 years ago
marcopede / AreasOfAttention
View on GitHub
☆10Apr 20, 2018Updated 8 years ago
yilunzhao / RobuT
View on GitHub
Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"
☆15Feb 8, 2024Updated 2 years ago
google / mcic-coco
View on GitHub
☆24Dec 22, 2016Updated 9 years ago