☆16Apr 1, 2022Updated 3 years ago
Alternatives and similar repositories for VideoTextSCM
Users that are interested in VideoTextSCM are comparing it to the libraries listed below
Sorting:
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformer☆106Mar 28, 2024Updated last year
- A new video text spotting framework with Transformer☆78May 23, 2022Updated 3 years ago
- Scene text recognition☆108Jul 7, 2022Updated 3 years ago
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"☆16May 3, 2023Updated 2 years ago
- PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition☆29Nov 11, 2022Updated 3 years ago
- TIoU metric in python3. Forked from https://github.com/Yuliang-Liu/TIoU-metric.☆26Nov 30, 2019Updated 6 years ago
- ☆32Nov 12, 2023Updated 2 years ago
- The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…☆282May 30, 2025Updated 9 months ago
- [AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer☆199Aug 31, 2023Updated 2 years ago
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆78Apr 9, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- Implementation of "GL-RG: Global-Local Representation Granularity for Video Captioning".☆27Dec 16, 2021Updated 4 years ago
- ☆69Oct 23, 2020Updated 5 years ago
- TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation (ECCV 2022)☆35Nov 12, 2024Updated last year
- This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Contex…☆96Feb 21, 2023Updated 3 years ago
- 短信验证码模块☆10Jul 25, 2021Updated 4 years ago
- Reimplemention of "Mask-Guided Attention Network for Occluded Pedestrian Detection" based on mmdetection toolbox☆10Aug 20, 2020Updated 5 years ago
- Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining☆353Nov 29, 2023Updated 2 years ago
- Official Code for ICCV 2025 paper — Beyond Isolated Words: Diffusion Brush for Handwritten Text-Line Generation☆140Nov 24, 2025Updated 3 months ago
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆106Dec 9, 2021Updated 4 years ago
- ☆89Feb 9, 2025Updated last year
- A lecture summarization tool that uses AI and computer vision to summarize and index videos☆11Dec 8, 2022Updated 3 years ago
- ☆10Nov 21, 2023Updated 2 years ago
- ☆10Jan 9, 2025Updated last year
- [Computers & Graphics 2021] Pair-wise Relation Module for 3D Object Detection☆14Mar 6, 2022Updated 3 years ago
- ☆11Nov 10, 2023Updated 2 years ago
- A synthetic training data generator for a text recognition CNN☆10Jul 8, 2019Updated 6 years ago
- Implementation of Boundary Attributions for Normal (Vector) Explanations☆11Aug 13, 2021Updated 4 years ago
- ☆18Aug 7, 2025Updated 6 months ago
- 看IT英文原版书籍时遇到的专业词汇☆10Nov 8, 2016Updated 9 years ago
- [CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).☆95Updated this week
- A simply Python script to easily grab tags of an image on Danbooru☆10Mar 17, 2023Updated 2 years ago
- ☆10May 4, 2023Updated 2 years ago
- This code is for converting COCO json annotations to YOLO txt format (which both are common in object detection projects).☆10Feb 19, 2024Updated 2 years ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 3 months ago
- Integration of Clinical Embeddings with Neural ODEs☆12Jan 6, 2025Updated last year
- AWESOME of Tencent Cloud Base 😎☆10Feb 2, 2019Updated 7 years ago
- Code of the paper https://arxiv.org/abs/2009.11939. A defocus blur estimation method.☆10Jan 13, 2022Updated 4 years ago
- Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)☆102Jun 28, 2024Updated last year