aimagelab / HWD
☆15Updated 7 months ago
Related projects: ⓘ
- ☆71Updated 9 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆38Updated this week
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆143Updated 4 months ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆47Updated last month
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆61Updated 2 months ago
- [ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation☆39Updated last week
- DoodleFormer: Creative Sketch Drawing with Transformers (ECCV22)☆25Updated last year
- The official project of paper "Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing"☆39Updated 2 weeks ago
- ☆54Updated last year
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆37Updated 2 months ago
- The official implementation for BLIP4CIR with bi-directional training | Bi-directional Training for Composed Image Retrieval via Text Pro…☆23Updated 7 months ago
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆98Updated last month
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆54Updated last year
- Composed Video Retrieval☆42Updated 4 months ago
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)☆47Updated 3 months ago
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆119Updated 2 months ago
- [BMVC 2023] Zero-shot Composed Text-Image Retrieval☆42Updated last year
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆28Updated last month
- [ECCV'24] Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities☆19Updated last month
- GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching☆18Updated 5 months ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆73Updated last month
- Official PyTorch Implementation of "DiffusionPen: Towards Controlling the Style of Handwritten Text Generation" - ECCV 2024☆12Updated 2 months ago
- The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…☆33Updated 2 years ago
- [CVPR 2023 (Highlight)] FAME-ViL: Multi-Tasking V+L Model for Heterogeneous Fashion Tasks☆48Updated 11 months ago
- Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023☆51Updated last year
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"☆17Updated last year
- ☆85Updated last month
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆58Updated last week
- Official implementation of High Fidelity Scene Text Synthesis.☆33Updated 3 weeks ago