Source code for paper: "AltDiffusion: A multilingual Text-to-Image diffusion model"
☆44Feb 15, 2024Updated 2 years ago
Alternatives and similar repositories for AltDiffusion
Users that are interested in AltDiffusion are comparing it to the libraries listed below
Sorting:
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆12Oct 28, 2024Updated last year
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 2 years ago
- ☆106Jan 6, 2026Updated 2 months ago
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 9 months ago
- A curated collection of prompts for Grok Imagine by xAI☆25Oct 19, 2025Updated 5 months ago
- ☆15May 13, 2024Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Jul 12, 2023Updated 2 years ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- Google's Conceptual Captions Dataset translated into Korean☆23Aug 28, 2022Updated 3 years ago
- ☆11Jan 16, 2020Updated 6 years ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting☆14Dec 19, 2025Updated 3 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- Unofficial PyTorch Reimplementation of UniformAugment.☆15Sep 7, 2020Updated 5 years ago
- Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets☆12May 25, 2023Updated 2 years ago
- The official PyTorch implementation of "Bridging the Domain Gap towards Generalization in Automatic Colorization", [ECCV 2022].☆36Jul 14, 2022Updated 3 years ago
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- ☆17Jul 9, 2024Updated last year
- Intuitive interface for fine-tuning and retraining a Tesseract OCR language model☆10Jul 4, 2025Updated 8 months ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- A curated list of Computer Vision related conferences with dates and paper registration deadlines.☆48Nov 2, 2025Updated 4 months ago
- ☆15Jan 9, 2026Updated 2 months ago
- ☆19Nov 7, 2023Updated 2 years ago
- TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers☆21Jul 26, 2022Updated 3 years ago
- ☆41Mar 27, 2024Updated last year
- ☆12Dec 14, 2024Updated last year
- [AAAI 2026 Oral] The official GitHub page of "PosterVerse: A Full-Workflow Framework for Commercial-Grade Poster Generation with HTML-Bas…☆42Jan 30, 2026Updated last month
- Generating Easy-to-Understand Referring Expressions for Target Identifications☆18Aug 30, 2019Updated 6 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 10 months ago
- ☆11Oct 16, 2023Updated 2 years ago
- Uncertainty-Guided Pseudo-Labelling with Model Averaging☆11Updated this week
- Code and Data for ManyModalQA: Modality Disambiguation and QA over Diverse Inputs☆17Mar 2, 2020Updated 6 years ago
- PyTorch implementation of STR models for transfer learning in Indic Languages☆16Sep 20, 2021Updated 4 years ago
- reimplement of "GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition"☆16Nov 10, 2020Updated 5 years ago
- “Style Transfer as Data Augmentation: A Case Study on Named Entity Recognition” (EMNLP 2022)☆16Feb 2, 2023Updated 3 years ago