Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)
☆12Mar 21, 2022Updated 4 years ago
Alternatives and similar repositories for VLPT-STD
Users that are interested in VLPT-STD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository of the ECCV 2024 paper "SelfGeo: Self-supervised and Geodesic-consistent Estimation of Keypoints on Deformable Shapes"☆15Sep 13, 2024Updated last year
- Thai font for amazfit bip☆12Jul 27, 2018Updated 7 years ago
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆14Nov 15, 2022Updated 3 years ago
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆20Jul 10, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆37Feb 21, 2026Updated 2 months ago
- A synthetic training data generator for a text recognition CNN☆10Jul 8, 2019Updated 6 years ago
- CV_JOB_interview_related_file☆10Jul 3, 2022Updated 3 years ago
- Code of paper "LP-Diff: Towards Improved Restoration of Real-World Degraded License Plate"☆19Jun 22, 2025Updated 10 months ago
- ☆29Aug 31, 2022Updated 3 years ago
- [ICCV 2023] Subclass-balancing contrastive learning for long-tailed recognition☆18Oct 30, 2023Updated 2 years ago
- ☆69Oct 23, 2020Updated 5 years ago
- TASR: Timestep-Aware Diffusion Model for Image Super-Resolution☆14Feb 21, 2025Updated last year
- Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining☆354Nov 29, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Aug 22, 2020Updated 5 years ago
- ☆13Mar 16, 2021Updated 5 years ago
- mkinitcpio scripts for loading LUKS secret from TPM 2.0☆11Jun 11, 2020Updated 5 years ago
- Dataset for Red Blood Cell Segmentation with Overlapping Cell Separation and Classification on Imbalanced Dataset☆13Dec 4, 2021Updated 4 years ago
- DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurrin…☆48Oct 22, 2024Updated last year
- 中译名著多译本翻译转述语料。语料仅限于用于科研教学活动。文本著作权归原著者。☆11Jul 26, 2018Updated 7 years ago
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆311Dec 2, 2024Updated last year
- Label smoothed Aggregation cross entropy loss for generalisation in sequence to sequence tasks.☆14Dec 17, 2019Updated 6 years ago
- Arbitrary Shape Text Detection via Segmentation with Probability Maps; accepted by TPAMI2022☆104Jun 30, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Another LaTex formula OCR tool☆15Feb 15, 2023Updated 3 years ago
- [AAAI2025] Revisiting Tampered Scene Text Detection in the Era of Generative AI☆68Apr 27, 2026Updated last week
- Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (…☆289Nov 29, 2024Updated last year
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- Code for SEEG: Semantic Energized Co-speech Gesture Generation☆33Dec 3, 2022Updated 3 years ago
- Classifying the Stanford Car dataset using ResNet 50☆25Aug 17, 2023Updated 2 years ago
- ☆13Sep 25, 2023Updated 2 years ago
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆105Dec 9, 2021Updated 4 years ago
- init☆11Sep 30, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于文本的垃圾短信分类_文本预处理☆13Jan 11, 2016Updated 10 years ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- [ECCV2022] The PyTorch implementation of paper "Equivariance and Invariance Inductive Bias for Learning from Insufficient Data"☆19Oct 12, 2022Updated 3 years ago
- ☆15Nov 26, 2023Updated 2 years ago
- Code for "Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation" (Findings of ACL 2024)☆16Jul 4, 2024Updated last year
- ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀☆25Sep 13, 2023Updated 2 years ago
- [ECCV '24] On the Utility of 3D Hand Poses for Action Recognition☆18Jun 8, 2025Updated 11 months ago