Scene-Text-Detection-And-Recognition-Model_M504
☆25Aug 21, 2024Updated last year
Alternatives and similar repositories for Scene-Text-Detection-And-Recognition-Model_M504
Users that are interested in Scene-Text-Detection-And-Recognition-Model_M504 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICIP 2024] Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model☆16May 23, 2025Updated 10 months ago
- [ICASSP 2025] PDSeg: Patch-Wise Distillation and Controllable Image Generation for Weakly-Supervised Histopathology Tissue Segmentation☆18May 23, 2025Updated 10 months ago
- ☆13Feb 27, 2024Updated 2 years ago
- ☆14Jan 27, 2026Updated last month
- [ICCV 2023] Class-incremental Continual Learning for Instance Segmentation with Image-level Weak Supervision☆11Oct 3, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Domain-Generalized Face Anti-Spoofing with Unknown Attacks. ICIP, 2023☆25Oct 17, 2023Updated 2 years ago
- Cheng-En Wu, Yi-Ming Chan and Chu-Song Chen "On Merging MobileNets for Efficient Multitask Inference", International Symposium on High-Pe…☆10May 11, 2020Updated 5 years ago
- Sound Classification Dataset☆11Oct 18, 2018Updated 7 years ago
- Optimal Planning for NTU YouBike Assignment with Operation Research and Machine Learning Techniques☆11Aug 28, 2024Updated last year
- TrustAi website☆13Sep 1, 2024Updated last year
- Apply an end-to-end model structure (ViT + GPT) to describe images in more detail, rather than traditional image captioning that only pro…☆12Jan 15, 2025Updated last year
- Apply pre-trained models to help quickly grasp investment news, including three tasks, 1. summarizationm 2. sentiment analysis 3. domain …☆14Sep 1, 2024Updated last year
- Yi-Min Chou, Chien-Hung Chen, Keng-Hao Liu, and Chu-Song Chen, "Changing Background to Foreground: An Augmentation Method Based on Condit…☆12Oct 23, 2018Updated 7 years ago
- Continual Learning for Visual Search with Backward Consistent Feature Embedding, CVPR 2022 https://openaccess.thecvf.com/content/CVPR2022…☆17Jun 14, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Kuang-Yu Chang, Kung-Hung Lu, and Chu-Song Chen, "Aesthetic Critiques Generation for Photos," International Conference on Computer Vision…☆18Oct 11, 2022Updated 3 years ago
- MCITlib: Multimodal Continual Instruction Tuning Library and Benchmark☆67Feb 16, 2026Updated last month
- Codes and Datasets for the Paper: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extracti…☆15Jun 5, 2024Updated last year
- Yi-Min Chou, Yi-Ming Chan, Jia-Hong Lee, Chih-Yi Chiu, Chu-Song Chen, "Unifying and Merging Well-trained Deep Neural Networks for Inferen…☆22Jan 30, 2021Updated 5 years ago
- Deep Learning for Computer Vision 深度學習於電腦視覺 by Frank Wang 王鈺強☆25Jun 30, 2024Updated last year
- Unofficial Implementation of paper "On Entropy Approximation for Gaussian Mixture Random Vectors" in python☆26Sep 1, 2024Updated last year
- Collection of Unsupervised Learning Methods for Vision-Language Models (VLMs)☆90Feb 2, 2026Updated last month
- Steven C. Y. Hung, Jia-Hong Lee, Timmy S. T. Wan, Chein-Hung Chen, Yi-Ming Chan and Chu-Song Chen. "Increasingly Packing Multiple Facial-…☆29Jan 8, 2021Updated 5 years ago
- A fast parallel implementation of RNN Transducer.☆12Apr 8, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Sep 17, 2024Updated last year
- TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data☆28Feb 28, 2024Updated 2 years ago
- CVPR25(Highlight)-Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior☆81Jun 15, 2025Updated 9 months ago
- [ECCV2024] Immunizing text-to-image Models against Malicious Adaptation☆17Jan 17, 2025Updated last year
- ☆14Jun 17, 2024Updated last year
- ☆30Jul 21, 2023Updated 2 years ago
- Jia-Hong Lee, Yi-Ming Chan, Ting-Yen Chen, and Chu-Song Chen, "Joint Estimation of Age and Gender from Unconstrained Face Images using Li…☆40Dec 20, 2019Updated 6 years ago
- Collecting a list of dataset with day and night annotations☆44Jul 30, 2018Updated 7 years ago
- Pytorch implementation of BigGAN Generator with pretrained weights☆42Jun 2, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆25Jan 25, 2025Updated last year
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆87Feb 14, 2025Updated last year
- A versatile generative model capable of designing topologies for wide range of analog circuits.☆101Apr 26, 2025Updated 10 months ago
- The official dataset of the flowvqa project.☆21Mar 26, 2024Updated 2 years ago
- ☆24Sep 20, 2024Updated last year
- ☆19Jun 3, 2024Updated last year
- ☆27Jun 5, 2024Updated last year