Hxyz-123/GoMatching

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Hxyz-123/GoMatching)

Hxyz-123 / GoMatching

[NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching

☆34

Alternatives and similar repositories for GoMatching

Users that are interested in GoMatching are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MiliLab / LogicOCR
View on GitHub
[arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?
☆35Dec 1, 2025Updated 7 months ago
ViTAE-Transformer / SAMText
View on GitHub
The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"
☆16May 3, 2023Updated 3 years ago
Hxyz-123 / ReasoningOCR
View on GitHub
☆18Jul 24, 2025Updated last year
DREAMXFAR / FCL-Net
View on GitHub
This is the pytorch implementation of FCL-Net, accepted by NN'2022.
☆15May 25, 2022Updated 4 years ago
ViTAE-Transformer / ViTAE-Transformer-Scene-Text-Detection
View on GitHub
A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research w…
☆94Nov 12, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ViTAE-Transformer / DeepSolo
View on GitHub
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…
☆294May 30, 2025Updated last year
ymy-k / Hi-SAM
View on GitHub
[IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
☆366May 30, 2025Updated last year
ymy-k / DPText-DETR
View on GitHub
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
☆204Aug 31, 2023Updated 2 years ago
Yuliang-Liu / VimTS
View on GitHub
VimTS: A Unified Video and Image Text Spotter
☆79Nov 10, 2024Updated last year
zhousheng97 / ViTXT-GQA
View on GitHub
[IEEE TMM'25] Scene-Text Grounding for Text-Based Video Question Answering
☆17Feb 16, 2026Updated 5 months ago
adeline-cs / GTR
View on GitHub
Scene text recognition
☆108Jul 7, 2022Updated 4 years ago
weijiawu / TransVTSpotter
View on GitHub
A new video text spotting framework with Transformer
☆82May 23, 2022Updated 4 years ago
WHU-ZQH / UIKA
View on GitHub
Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis
☆17Mar 27, 2023Updated 3 years ago
weijiawu / BOVText-Benchmark
View on GitHub
[NeurIPS2021] BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting
☆71Oct 9, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mxin262 / Bridging-Text-Spotting
View on GitHub
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
☆75Jun 11, 2024Updated 2 years ago
gaozhitong / ATTA
View on GitHub
Code for "ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation" (NeurIPS 23)
☆16Apr 12, 2024Updated 2 years ago
WHU-ZQH / KGAN
View on GitHub
[TKDE] Knowledge Graph Augmented Network Towards Multiview Representation Learning for Aspect-based Sentiment Analysis
☆52Apr 4, 2024Updated 2 years ago
AronCao49 / Latte
View on GitHub
[ECCV 2024] Reliable Spatial-Temporal Voxels for Multi-Modal Test-Time Adaptation
☆18Jan 12, 2026Updated 6 months ago
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
weijiawu / CisDQ
View on GitHub
☆13Nov 29, 2023Updated 2 years ago
DrLuo / SemiETS
View on GitHub
【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
☆17Jul 1, 2025Updated last year
csguoh / KD-LTR
View on GitHub
[MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"
☆16Nov 3, 2023Updated 2 years ago
Skyline-9 / Visionary-Vids
View on GitHub
Multi-modal transformer approach for natural language query based joint video summarization and highlight detection
☆17May 23, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cognitedata / Qwen-VL-finetune
View on GitHub
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
☆18Jun 5, 2024Updated 2 years ago
ZeningLin / PEneo
View on GitHub
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
☆41Apr 7, 2025Updated last year
wondervictor / AttendCRNN
View on GitHub
CRNN with Self-Attention
☆10Apr 8, 2018Updated 8 years ago
lcy0604 / CTRNet
View on GitHub
This repository is the implementation of "Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Contex…
☆97Feb 21, 2023Updated 3 years ago
LukeForeverYoung / UReader
View on GitHub
☆142Feb 13, 2024Updated 2 years ago
donavaly / SleukRith-Set
View on GitHub
☆14Jan 21, 2019Updated 7 years ago
WHU-ZQH / ChatGPT-vs.-BERT
View on GitHub
🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT
☆191Apr 17, 2023Updated 3 years ago
stoneMo / CIGN
View on GitHub
Official implementation for CIGN
☆17Sep 11, 2023Updated 2 years ago
MiliLab / AnesSuite
View on GitHub
Official repo for [ICLR 2026] "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"
☆25Feb 28, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xdxie / WAS_WordArt-Segmentation
View on GitHub
The official codes and datasets for Artistic Text Segmentation (ECCV 2024).
☆30Sep 24, 2025Updated 10 months ago
kai422 / SCALE
View on GitHub
[ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.
☆15Mar 12, 2024Updated 2 years ago
Wei-ucas / TPSNet
View on GitHub
☆28Nov 29, 2023Updated 2 years ago
amazon-science / glass-text-spotting
View on GitHub
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Jun 28, 2024Updated 2 years ago
janzd / awesome-scene-text
View on GitHub
A curated list of papers and resources for scene text detection and recognition
☆49Feb 9, 2026Updated 5 months ago
Xuchen-Li / Awesome-Vision-Language-Tracking
View on GitHub
A vision-language tracking paper list, articles related to visual language tracking have been documented.
☆46Dec 15, 2024Updated last year
lsabrinax / VideoTextSCM
View on GitHub
☆16Apr 1, 2022Updated 4 years ago