hssip / FashionSAPLinks
CVPR2023 paper
☆52Updated 2 years ago
Alternatives and similar repositories for FashionSAP
Users that are interested in FashionSAP are comparing it to the libraries listed below
Sorting:
- [CVPR 2023 (Highlight)] FAME-ViL: Multi-Tasking V+L Model for Heterogeneous Fashion Tasks☆55Updated last year
- [ECCV 2022] FashionViL: Fashion-Focused V+L Representation Learning☆61Updated 2 years ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆87Updated 7 months ago
- ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.☆85Updated 2 years ago
- This is the official repository for the paper "OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data". …☆69Updated last year
- [CVPR(W) 2022] UIGR: Unified Interactive Garment Retrieval☆22Updated 3 years ago
- The official PyTorch implementation for arXiv'23 paper 'LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer'☆100Updated 4 months ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆108Updated last year
- Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge☆46Updated last year
- Text-Conditioned Fashion Image Editing☆66Updated 2 years ago
- Cheng-Fu Yang*, Wan-Cyuan Fan*, Fu-En Yang, Yu-Chiang Frank Wang, "LayoutTransformer: Scene Layout Generation with Conceptual and Spatial…☆63Updated 3 years ago
- Use CLIP to represent video for Retrieval Task☆70Updated 4 years ago
- ☆92Updated 2 years ago
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆28Updated 3 years ago
- Code for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-L…☆37Updated 3 years ago
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆68Updated last year
- FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions☆55Updated last year
- Masked Vision-Language Transformer in Fashion☆35Updated last year
- Modality-Agnostic Attention Fusion for visual search with text feedback☆25Updated 2 years ago
- [ICCV 2023] Controllable Person Image Synthesis with Pose‑Constrained Latent Diffusion☆42Updated last year
- Code for our ECCV-2022 work: Fashionformer A simple, effective and unified baseline for human fashion segmentation and recognition☆113Updated last year
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆86Updated 2 years ago
- [AAAI 2021] The official repo for the paper "KGDet: Keypoint-Guided Fashion Detection".☆45Updated 4 years ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)☆93Updated 9 months ago
- Code for the Video Similarity Challenge.☆80Updated last year
- Training code for CLIP-FlanT5☆29Updated last year
- ☆134Updated last year
- Source code of the TextLap model, a LLM for text-2-layout generation.☆15Updated 10 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- ☆41Updated last year