yyyyyxie/DNTextSpotter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yyyyyxie/DNTextSpotter)

yyyyyxie / DNTextSpotter

[ACMMM 2024]: Official implementation of the paper "DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training"

☆38

Alternatives and similar repositories for DNTextSpotter

Users that are interested in DNTextSpotter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BaofengZan / GOT-OCRv2-onnx
View on GitHub
用于学习GOT/Qwen/OnnxLLm
☆55Oct 8, 2024Updated last year
blisgard / BucketedRankingBasedLosses
View on GitHub
Official PyTorch Implementation of Bucketed Ranking-based Losses for Efficient Training of Object Detectors [ECCV2024]
☆26Apr 27, 2025Updated last year
Gyann-z / FDP
View on GitHub
☆16Apr 21, 2025Updated last year
MiliLab / LogicOCR
View on GitHub
[arXiv: 2505.12307] LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?
☆35Dec 1, 2025Updated 7 months ago
hxixixh / amo-release
View on GitHub
Official implementation for CVPR 2025 paper "AMO Sampler: Enhancing Text Rendering with Overshooting"
☆30May 3, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wzx99 / TMIM
View on GitHub
☆13Oct 17, 2024Updated last year
ming71 / GCL
View on GitHub
GCL implementation
☆14Mar 7, 2024Updated 2 years ago
nailwatts / FNIN
View on GitHub
FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients
☆13Jan 22, 2025Updated last year
hanqiu-hq / MAD
View on GitHub
☆14Sep 9, 2024Updated last year
Mountchicken / Union14M
View on GitHub
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
☆206Nov 1, 2023Updated 2 years ago
mittagessen / curt
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
ymy-k / Hi-SAM
View on GitHub
[IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
☆368May 30, 2025Updated last year
Wei-ucas / TPSNet
View on GitHub
☆28Nov 29, 2023Updated 2 years ago
ViTAE-Transformer / DeepSolo
View on GitHub
The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…
☆294May 30, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PriNing / ODM
View on GitHub
ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting
☆45Apr 11, 2025Updated last year
dnjs3594 / Eigencontours
View on GitHub
☆27Oct 25, 2022Updated 3 years ago
Gumpest / MasKD
View on GitHub
Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.
☆10Mar 13, 2023Updated 3 years ago
Atten4Vis / MS-DETR
View on GitHub
[CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"
☆128Jul 10, 2024Updated 2 years ago
wenwenyu / TCM
View on GitHub
Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)
☆202Jun 17, 2024Updated 2 years ago
annosubmission / GRC-Cache
View on GitHub
☆16Mar 13, 2023Updated 3 years ago
bytedance / E2STR
View on GitHub
The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
☆55Jun 14, 2024Updated 2 years ago
amazon-science / glass-text-spotting
View on GitHub
Official implementation for "GLASS: Global to Local Attention for Scene-Text Spotting" (ECCV'22)
☆102Jun 28, 2024Updated 2 years ago
ymy-k / DPText-DETR
View on GitHub
[AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer
☆204Aug 31, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
iSEE-Laboratory / Frozen-DETR
View on GitHub
(NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"
☆34Mar 22, 2025Updated last year
zhang-chenxu / LSM-Adapter
View on GitHub
Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter [ECCV2024]
☆28Updated this week
FingerRec / awesome_video_self_supervised
View on GitHub
awesome video-based self-supervised learning methods in recently years
☆10Nov 26, 2020Updated 5 years ago
Tang1705 / SeeClear-NeurIPS24
View on GitHub
[NeurIPS 2024] SeeClear: This repo is the official implementation of "SeeClear: Semantic Distillation Enhances Pixel Condensation for Vid…
☆18Oct 8, 2024Updated last year
Ruiyang-061X / Awesome-MLLM-Reasoning
View on GitHub
📖Curated list about reasoning abilitiy of MLLM, including OpenAI o1, OpenAI o3-mini, and Slow-Thinking.
☆13Feb 7, 2025Updated last year
mrlooi / convert_to_coco
View on GitHub
Scripts for converting various datasets to MSCOCO annotation (json) files
☆12Jun 5, 2019Updated 7 years ago
csguoh / LEMMA
View on GitHub
[IJCAI2023] Your text images can be clearer!
☆59Nov 18, 2025Updated 8 months ago
yeezhu / UNIT
View on GitHub
PyTorch implementation of "UNIT: Unifying Image and Text Recognition in One Vision Encoder", NeurlPS 2024.
☆34Sep 26, 2024Updated last year
Hxyz-123 / GoMatching
View on GitHub
[NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching
☆34May 29, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hanabi7 / point_cloud_smplify
View on GitHub
smplify code for point cloud based HMR
☆10Jan 11, 2022Updated 4 years ago
DWCTOD / arXiv-CVPR2022-daily
View on GitHub
CVPR2022 update everyday!
☆11Apr 12, 2022Updated 4 years ago
ayumiymk / DiG
View on GitHub
Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`
☆74Feb 27, 2023Updated 3 years ago
ViTAE-Transformer / SAMText
View on GitHub
The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"
☆16May 3, 2023Updated 3 years ago
leaf1170124460 / Mask3D-SHIFT
View on GitHub
This repository provides a multi task benchmark for instance segmentation, depth estimation, and 3D object detection.
☆14Jul 29, 2023Updated 2 years ago
Eurus-Holmes / SynthText_CH
View on GitHub
[SynthText Chinese] Improved code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural I…
☆14Dec 8, 2022Updated 3 years ago
Hxyz-123 / ReasoningOCR
View on GitHub
☆18Jul 24, 2025Updated last year