FaltingsA/SSM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FaltingsA/SSM)

FaltingsA / SSM

[IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition

☆10

Alternatives and similar repositories for SSM

Users that are interested in SSM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wzx99 / CLIPOCR
View on GitHub
☆38Oct 20, 2023Updated 2 years ago
CyrilSterling / LPV
View on GitHub
The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)
☆26Sep 3, 2023Updated 2 years ago
Tianhao-Qi / BACL
View on GitHub
Balanced Classification: A Unified Framework for Long-Tailed Object Detection (TMM 2023)
☆101Apr 18, 2025Updated last year
chenkenanalytic / handwritting_data_all
View on GitHub
☆12Sep 25, 2022Updated 3 years ago
Gyann-z / FDP
View on GitHub
☆16Apr 21, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ali-vilab / CAPability
View on GitHub
What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness
☆28May 16, 2025Updated last year
lntzm / MESM
View on GitHub
The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)
☆32Mar 29, 2024Updated 2 years ago
Eniac-Xie / FuseTeacher
View on GitHub
☆12Nov 26, 2024Updated last year
LF-WEN / HGDM
View on GitHub
Official implementation of the paper: "Hyperbolic Graph Diffusion Model"
☆14Jun 22, 2024Updated 2 years ago
IMCCretrieval / ProST
View on GitHub
Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral
☆92Nov 2, 2023Updated 2 years ago
violet-liang / soundfield-reconstruction-np
View on GitHub
Sound field reconstruction using neural processes with dynamic kernels
☆16Mar 25, 2025Updated last year
TongkunGuan / CCD
View on GitHub
[ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition
☆153Jul 12, 2026Updated last week
irisXcoding / DocReal
View on GitHub
DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction
☆30Jun 28, 2023Updated 3 years ago
jingwangsg / MS-DETR
View on GitHub
An official implementation for MS-DETR in ACL'23
☆17Jun 3, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ali-vilab / DreamVideo-Omni
View on GitHub
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
☆16May 27, 2026Updated last month
qiwang067 / CoWorld
View on GitHub
[NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…
☆28Oct 24, 2025Updated 8 months ago
Eniac-Xie / TEAM
View on GitHub
☆22Jun 15, 2023Updated 3 years ago
iyyakuttiiyappan / CPLIP
View on GitHub
☆21Jul 18, 2024Updated 2 years ago
ArieSeirack / DHVT
View on GitHub
This is an official implementation of our NeurIPS 2022 paper "Bridging the Gap Between Vision Transformers and Convolutional Neural Netwo…
☆63Aug 20, 2025Updated 11 months ago
bytedance / DEADiff
View on GitHub
[CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"
☆280Jul 5, 2025Updated last year
TongkunGuan / SIGA
View on GitHub
[CVPR2023] Self-supervised Implicit Glyph Attention for Text Recognition
☆110Mar 9, 2025Updated last year
ayumiymk / DiG
View on GitHub
Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`
☆74Feb 27, 2023Updated 3 years ago
cilabuniba / i-dream-my-painting
View on GitHub
[WACV 2025] I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting
☆17Dec 29, 2025Updated 6 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
FangShancheng / ABINet-PP
View on GitHub
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting
☆90Feb 11, 2023Updated 3 years ago
soon-yau / visconet
View on GitHub
Official Repo of ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet
☆30Oct 17, 2024Updated last year
baopj / DenseEventsGrounding
View on GitHub
☆17Dec 25, 2023Updated 2 years ago
thanhluantrinh / LDDGAN
View on GitHub
☆29Jan 15, 2025Updated last year
MegEngine / MegDiffusion
View on GitHub
MegEngine implementation of Diffusion Models.
☆19Aug 8, 2022Updated 3 years ago
lhaof / CGT
View on GitHub
Cell Graph Transformer for Nuclei Classification, AAAI 2024
☆28Oct 8, 2024Updated last year
L-O-I / RRVF
View on GitHub
☆18Aug 7, 2025Updated 11 months ago
PRIS-CV / On-the-fly-Category-Discovery
View on GitHub
Code release for Your “On-the-fly Category Discovery (CVPR 2023)”
☆58Jul 15, 2023Updated 3 years ago
R-J96 / stainFuser
View on GitHub
Controlling Diffusion for Faster Neural Style Transfer in Multi-Gigapixel Histology Images
☆24Jun 6, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wangyuxin87 / Tampered_sroie
View on GitHub
The tampered text detection dataset
☆22Aug 23, 2023Updated 2 years ago
buttercutter / Mamba_SSM
View on GitHub
A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)
☆23Jan 22, 2024Updated 2 years ago
TongkunGuan / RFN
View on GitHub
[TCSVT2022] Industria Scene Text Detection
☆84Mar 3, 2023Updated 3 years ago
ali-vilab / DreamRelation
View on GitHub
[ICCV2025] The official code of "DreamRelation: Relation-Centric Video Customization"
☆27Feb 4, 2026Updated 5 months ago
alibaba-damo-academy / alice
View on GitHub
☆58May 26, 2024Updated 2 years ago
4m4n5 / NASDM
View on GitHub
Pytorch implementation of NASDM: Nuclei-Aware Semantic Histopathology Image Generation Using Diffusion Models
☆20May 9, 2024Updated 2 years ago
bcmi / TopNet-Object-Placement
View on GitHub
An unofficial implementation of the paper "TopNet: Transformer-based Object Placement Network for Image Compositing", CVPR 2023.
☆32Feb 24, 2026Updated 4 months ago