[IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering
β17Feb 16, 2026Updated last month
Alternatives and similar repositories for ViTXT-GQA
Users that are interested in ViTXT-GQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR'25] ππ EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answeringβ46Jun 19, 2025Updated 9 months ago
- [ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.β15Mar 12, 2024Updated 2 years ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matchingβ31May 29, 2025Updated 10 months ago
- γCVPR 2025γSemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spottingβ16Jul 1, 2025Updated 8 months ago
- β12Oct 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- GCL implementationβ14Mar 7, 2024Updated 2 years ago
- This project summarizes the CLIP-based cross-modal hashing methods. Including DCMHT, MITH, DSPH, DNPH, TwDH (Two-Step Discrete Hashing foβ¦β49Sep 15, 2025Updated 6 months ago
- [NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spottingβ70Oct 9, 2023Updated 2 years ago
- FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradientsβ14Jan 22, 2025Updated last year
- β27Jan 28, 2026Updated 2 months ago
- β16Apr 21, 2025Updated 11 months ago
- [CVPR 2022] Accelerating Video Object Segmentation with Compressed Videoβ42Jul 3, 2022Updated 3 years ago
- β14Sep 9, 2024Updated last year
- β10Mar 31, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Project page for "Morphology-Aware Interactive Keypoint Estimation" accepted in MICCAI 2022.β13Sep 14, 2024Updated last year
- β10May 4, 2018Updated 7 years ago
- Code of the Grounded MUIE model, REAMOβ11Dec 3, 2024Updated last year
- β30Oct 16, 2025Updated 5 months ago
- Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter [ECCV2024]β24Mar 10, 2026Updated 2 weeks ago
- This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.β17Sep 2, 2025Updated 6 months ago
- code for paper `MemCap: Memorizing Style Knowledge for Image Captioning`β11Mar 17, 2020Updated 6 years ago
- this repo contains some useful metadata for Fashion IQ challenge: https://sites.google.com/view/lingir/fashion-iqβ15Jun 28, 2019Updated 6 years ago
- β27Oct 25, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- β21Apr 10, 2024Updated last year
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"β46Apr 27, 2025Updated 11 months ago
- Comprehensive benchmark for video text understandingβ28Jun 4, 2025Updated 9 months ago
- [CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agentsβ32Jun 3, 2025Updated 9 months ago
- Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025β17Jul 14, 2025Updated 8 months ago
- The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"β15Sep 19, 2024Updated last year
- [ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentanglβ¦β23Jul 28, 2025Updated 8 months ago
- β20Jul 28, 2025Updated 8 months ago
- The open-source code for paper 'Learning to Detect Video Saliency With HEVC Features'β13Nov 9, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- β37Apr 18, 2024Updated last year
- Official PyTorch Implementation of Bucketed Ranking-based Losses for Efficient Training of Object Detectors [ECCV2024]β26Apr 27, 2025Updated 11 months ago
- Towards Video Text Visual Question Answering: Benchmark and Baselineβ40Feb 26, 2024Updated 2 years ago
- Source code of our TMM 2019 paper "Multi-pathway Generative Adversarial Hashing for Unsupervised Cross-modal Retrieval"β12Jun 17, 2019Updated 6 years ago
- About Codes for ACL 2023 paper: Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling.β20Jun 25, 2024Updated last year
- β23Aug 1, 2025Updated 7 months ago
- Source code for ACL-IJCNLP 2021 findings paper: MRN: A Locally and Globally Mention-Based Reasoning Network for Document-Level Relation Eβ¦β14Feb 25, 2022Updated 4 years ago