[IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering
☆16Feb 16, 2026Updated 3 weeks ago
Alternatives and similar repositories for ViTXT-GQA
Users that are interested in ViTXT-GQA are comparing it to the libraries listed below
Sorting:
- [CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering☆46Jun 19, 2025Updated 8 months ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆28May 29, 2025Updated 9 months ago
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting☆68Oct 9, 2023Updated 2 years ago
- HeadlessPivot☆29Feb 27, 2026Updated last week
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 11 months ago
- TensorRT In Docker☆11Dec 7, 2024Updated last year
- ☆16Updated this week
- Towards Video Text Visual Question Answering: Benchmark and Baseline☆40Feb 26, 2024Updated 2 years ago
- ⚡ Lightning-fast local log search - grep for logs, but actually good☆26Updated this week
- Numpy like ndarray and dataframe library for nim-lang.☆13Aug 6, 2020Updated 5 years ago
- Awesome-GenAITech: a curated list of Generative AI Techniques☆11Jul 11, 2023Updated 2 years ago
- [ICPR-2024] S-MultiMAE - A Multi-Ground Truth approach for RGB-D Saliency Detection☆12Dec 13, 2024Updated last year
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 7 months ago
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- ☆12Oct 5, 2024Updated last year
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Feb 24, 2023Updated 3 years ago
- STRExp is a framework that provides Explainability (XAI) to Scene Text Recognition (STR) models.☆11Nov 27, 2023Updated 2 years ago
- NLP Workshops☆12Apr 24, 2025Updated 10 months ago
- SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition☆10Apr 8, 2024Updated last year
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated 11 months ago
- FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients☆13Jan 22, 2025Updated last year
- Quantization of LLMs and benchmarking.☆10Apr 3, 2024Updated last year
- ☆10Mar 31, 2025Updated 11 months ago
- ☆12Dec 15, 2022Updated 3 years ago
- Ever wondered how popular your GitHub repo is compared to others?☆16Feb 14, 2026Updated 3 weeks ago
- Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition☆16Jan 21, 2025Updated last year
- Implementation of the DocLLM paper for Llama models.☆13Apr 6, 2025Updated 11 months ago
- Easy interactive prompts to create and validate data using JSON schema.☆10Feb 27, 2026Updated last week
- Json patch serializer☆13Dec 12, 2020Updated 5 years ago
- 【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting☆16Jul 1, 2025Updated 8 months ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- ☆10May 4, 2018Updated 7 years ago
- A curated list of resources on Document Layout Analysis☆11Aug 7, 2025Updated 7 months ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).☆13Jun 22, 2022Updated 3 years ago
- Dataset of scientific abstracts for the purpose of sentence classification☆10Sep 17, 2019Updated 6 years ago
- [arXiv 2024] PyTorch implementation of RRD: https://arxiv.org/abs/2407.12073☆15Dec 2, 2025Updated 3 months ago
- Code of the Grounded MUIE model, REAMO☆11Dec 3, 2024Updated last year
- Training PyTorch Faster-RCNN on custom dataset☆14Jun 2, 2021Updated 4 years ago