DCGM/SoftCTC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DCGM/SoftCTC)

DCGM / SoftCTC

This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135

☆19

Alternatives and similar repositories for SoftCTC

Users that are interested in SoftCTC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Pay20Y / PIMNet
View on GitHub
☆16Jan 30, 2022Updated 4 years ago
onealwj / MVLT
View on GitHub
PyTorch implementation of BMVC2022 paper Masked Vision-Language Transformers for Scene Text Recognition
☆28Nov 11, 2022Updated 3 years ago
kartikgill / taco-box
View on GitHub
An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR
☆15Dec 4, 2021Updated 4 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
amazon-science / textadain-robust-recognition
View on GitHub
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
☆21Jul 26, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
HCIILAB / LAST
View on GitHub
Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition
☆28Aug 29, 2023Updated 2 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
lancercat / OSOCR
View on GitHub
☆10Nov 21, 2023Updated 2 years ago
Xiaomeng-Yang / STR_benchmark_cleansed
View on GitHub
☆14May 26, 2023Updated 3 years ago
GoodNotes / GNHK-dataset
View on GitHub
☆19Mar 28, 2022Updated 4 years ago
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
View on GitHub
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆13Dec 4, 2024Updated last year
Actasidiot / EFIFSTR
View on GitHub
[ACM MM 2020] Exploring Font-independent Features for Scene Text Recognition
☆44Nov 30, 2020Updated 5 years ago
ThunderVVV / RCLSTR
View on GitHub
Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`
☆17Sep 22, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhaominyiz / STIRER
View on GitHub
STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition -- ACMMM 2023
☆14Dec 2, 2024Updated last year
Ashigarg123 / ShiftySpeech
View on GitHub
☆15Jul 24, 2025Updated 11 months ago
amazon-science / semimtr-text-recognition
View on GitHub
Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)
☆83Sep 12, 2023Updated 2 years ago
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
christofw / multipitch_architectures
View on GitHub
Pytorch project accompanying the paper "Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings", …
☆15Aug 26, 2022Updated 3 years ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
csguoh / KD-LTR
View on GitHub
[MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"
☆16Nov 3, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
koninik / WordStylist
View on GitHub
Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023
☆82Jun 25, 2024Updated 2 years ago
zhaominyiz / C3-STISR
View on GitHub
Official Code for 'C3-STISR: Scene Text Image Super-resolution with Triple Clues' - IJCAI 2022
☆64Nov 20, 2022Updated 3 years ago
Caiyuan-Zheng / Consistency_Regularization_STR
View on GitHub
It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.
☆28Jul 6, 2022Updated 4 years ago
iiclab / DecompST
View on GitHub
☆15Nov 26, 2023Updated 2 years ago
Wei-ucas / TPSNet
View on GitHub
☆28Nov 29, 2023Updated 2 years ago
Wang-Tianwei / Implicit-feature-alignment
View on GitHub
Pytorch implementation for "Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter".
☆67Jun 15, 2021Updated 5 years ago
frotms / Curve-Text-Rectification-Using-Pairs-Of-Points
View on GitHub
A way to rectify curve text images using spatial transformer by pairs of points.
☆40Dec 9, 2020Updated 5 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
ryanrudes / YTTTS
View on GitHub
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆53Apr 1, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
fgnt / speaker_reassignment
View on GitHub
Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
☆14Feb 5, 2025Updated last year
Chuhanxx / FontAdaptor
View on GitHub
Data and implementation of ECCV2020 paper 'Adaptive Text Recognition through Visual Matching'
☆124Nov 22, 2022Updated 3 years ago
weijiawu / Polygon-free-Unconstrained-Scene-Text-Detection-with-Box-Annotations
View on GitHub
Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training
☆34Nov 24, 2022Updated 3 years ago
ayumiymk / DiG
View on GitHub
Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`
☆74Feb 27, 2023Updated 3 years ago
simplify23 / TPS_PP
View on GitHub
Official Pytorch implementations of TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition (IJCAI 2023）
☆42Aug 13, 2023Updated 2 years ago
rohitsaluja22 / OCR-On-the-go
View on GitHub
For ICDAR 2019 Paper on End-to-end License Plate and Scene Text Recognition with multi-head attention models
☆25Aug 14, 2021Updated 4 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago