PeterrrrLi / ResNet-Transformer-OCR-Pytorch
ResNet for License Plate Detection & Multi-Head Self Attention Transformer for OCR
☆13Updated last year
Related projects: ⓘ
- ReViT - Residual Attention Vision Transformer☆26Updated 6 months ago
- The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with…☆49Updated 2 months ago
- The official PyTorch implementation of our paper "MCA: Multidimensional collaborative attention in deep convolutional neural networks for…☆46Updated last year
- A PyTorch implementation of FasterNet: Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks☆24Updated last year
- Official repository of Slide-Transformer (CVPR2023)☆157Updated 3 weeks ago
- Official code for 'Super-Resolution of License Plate Images Using Attention Modules and Sub-Pixel Convolution Layers' (Computers & Graphi…☆26Updated last month
- InceptionNeXt: When Inception Meets ConvNeXt (CVPR 2024)☆225Updated 10 months ago
- The official code of "Rethinking Local Perception in Lightweight Vision Transformer"☆83Updated last year
- Official PyTorch implementation of "CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection"☆14Updated 5 months ago
- ☆38Updated 5 months ago
- ☆46Updated last year
- ☆70Updated last month
- Code Implementation of EfficientVMamba☆172Updated 5 months ago
- TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual Recognition☆109Updated 9 months ago
- ☆62Updated 4 months ago
- SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network☆15Updated 9 months ago
- The official project website of "KernelWarehouse: Rethinking the Design of Dynamic Convolution" (KW for short, accepted to ICML 2024)☆89Updated 3 months ago
- (CVPR2024)RMT: Retentive Networks Meet Vision Transformer☆272Updated last month
- Pytorch code for our CVPRw 2023 paper "Cascaded Zoom-in Detector for High Resolution Aerial Images"☆50Updated last month
- ☆78Updated 5 months ago
- Cross-Layer Feature Pyramid Transformer for Small Object Detection in Aerial Images☆38Updated 3 weeks ago
- Official ImageNet Model repository☆212Updated last year
- [MICCAI'23] Official implementation of "RCS-YOLO: A Fast and High-Accuracy Object Detector for Brain Tumor Detection".☆68Updated 3 weeks ago
- Implementation Code for the ICCASSP 2023 paper " Efficient Multi-Scale Attention Module with Cross-Spatial Learning" and is available at:…☆167Updated 4 months ago
- GroupMixAttention and GroupMixFormer☆108Updated 9 months ago
- [IJCAI2023] An official implement of the paper "Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement"☆51Updated last year
- ☆79Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆168Updated last month
- AFFNet-Unofficial Implementation☆13Updated last year
- the official pytorch implementation of “Mamba-YOLO:SSMs-based for Object Detection”☆212Updated last month