Comprehensive benchmark for video text understanding
☆28Jun 4, 2025Updated 11 months ago
Alternatives and similar repositories for VidText
Users that are interested in VidText are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optocal Character Recognition (OCR / HTR) using Transformers☆11Aug 20, 2022Updated 3 years ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆56Mar 9, 2025Updated last year
- ☆33Jan 28, 2026Updated 3 months ago
- 【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending☆14Jun 16, 2025Updated 11 months ago
- This demo demonstrates the AI capabilities of the mcxn947. It displays the image captured by the camera on the LCD screen and performs fa…☆12May 18, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 🔥🔥[NeurIPS2025]Exploring and mitigating semantic hallucinations in scene text perception and reasoning☆29Dec 11, 2025Updated 5 months ago
- Multiple-Person Multi-Camera Tracker☆13Feb 17, 2017Updated 9 years ago
- ☆13May 17, 2025Updated last year
- 哈工大高级算法设计与分析研究生 课程实验(2020春)☆23Jun 6, 2020Updated 5 years ago
- Add YOLOv3_tiny and data augment(clip, brighten, change saturation)☆14Jan 14, 2021Updated 5 years ago
- [IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering☆17Feb 16, 2026Updated 3 months ago
- [ACCV 2024 (Oral, Best Application Paper)] Official Implementation of NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tra…☆16Dec 30, 2025Updated 4 months ago
- [TPAMI2025] BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors☆16Apr 23, 2025Updated last year
- (CVPR 2025 Highlight) Official repository of paper "AODRaw: Towards RAW Object Detection in Diverse Conditions" (https://arxiv.org/pdf/24…☆24Apr 6, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ECCV2024] ModTr: Modality Translation for Object Detection Adaptation Without Forgetting Prior Knowledge☆19Nov 28, 2024Updated last year
- ☆25Jul 20, 2025Updated 10 months ago
- 端到端的中文场景文字识别。☆12Jun 27, 2022Updated 3 years ago
- Rui Qian, Xin Lai, Xirong Li: BADet: Boundary-Aware 3D Object Detection from Point Clouds (Pattern Recognition 2022: IF=8.518)☆13Feb 12, 2026Updated 3 months ago
- Mutual Information Predicts Hallucinations in Abstractive Summarization☆13Nov 14, 2022Updated 3 years ago
- ☆52Oct 20, 2025Updated 7 months ago
- ☆32Oct 17, 2025Updated 7 months ago
- Fast Recursive DNS server☆16Oct 13, 2015Updated 10 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆30Jul 12, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view Images☆21Jun 24, 2024Updated last year
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆247Nov 6, 2025Updated 6 months ago
- SmallBigNet: Integrating Core and Contextual Views for Video Classification (CVPR2020)☆41Mar 10, 2022Updated 4 years ago
- ☆24Sep 12, 2024Updated last year
- [NeurIPS 2024 Oral] RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation☆19Dec 22, 2024Updated last year
- ☆33Sep 27, 2024Updated last year
- 基于TLD算法和GOTURN算法的多摄像头目标跟踪☆26Mar 22, 2020Updated 6 years ago
- 红外和可见光融合☆10Apr 17, 2019Updated 7 years ago
- ☆21Feb 29, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆21Dec 10, 2024Updated last year
- 🔥🔥First-ever hour scale video understanding models☆622Jul 14, 2025Updated 10 months ago
- Find strongest response of convolutional layers on an image dataset. Automatically compute receptive field for any CNN layer.☆14Feb 19, 2021Updated 5 years ago
- MXNET实现的年龄性别识别 ,训练了超大数据集得到的模型。☆32Dec 24, 2024Updated last year
- 哈工大(本部)计算机专业研究生课程攻略 | HIT CS Postgraduate Guide☆407Dec 23, 2025Updated 5 months ago
- DPS-Net: Deep Polarimetric Stereo Depth Estimation☆22Mar 22, 2024Updated 2 years ago
- 'Minimum Delay Object Detection From Video', ICCV 2019☆30Oct 3, 2019Updated 6 years ago