Official implementation for "FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval"
☆19Oct 27, 2025Updated 4 months ago
Alternatives and similar repositories for FashionERN_AAAI2024
Users that are interested in FashionERN_AAAI2024 are comparing it to the libraries listed below
Sorting:
- ☆19Mar 5, 2025Updated last year
- ☆10Oct 25, 2024Updated last year
- ☆11May 17, 2024Updated last year
- ☆12Feb 2, 2023Updated 3 years ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆56May 27, 2025Updated 9 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- [CVPR 2025] Multi-focal Conditioned Latent Diffusion for Person Image Synthesis☆22Mar 23, 2025Updated 11 months ago
- Code for paper: "Region Proposals for Saliency Map Refinement for Weakly-supervised Disease Localisation and Classification"☆14Jun 29, 2021Updated 4 years ago
- In this work, we implement different cross-modal learning schemes such as Siamese Network, Correlational Network and Deep Cross-Modal Pro…☆11Aug 23, 2021Updated 4 years ago
- Dynamic Modality Interaction Modeling for Image-Text Retrieval. SIGIR'21☆70May 26, 2022Updated 3 years ago
- ☆86Apr 21, 2025Updated 10 months ago
- ☆19Jun 19, 2023Updated 2 years ago
- ☆17Oct 7, 2022Updated 3 years ago
- GRE 再要你命3K 背单词小程序☆22Jul 8, 2018Updated 7 years ago
- Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024☆21May 30, 2024Updated last year
- ☆24Dec 7, 2023Updated 2 years ago
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆27Dec 5, 2023Updated 2 years ago
- [NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing☆25Dec 8, 2024Updated last year
- Codes for the WACV 2022 paper: "SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning"☆23Apr 29, 2022Updated 3 years ago
- A vision-language-safety action architecture, named AEGIS, which contains a plug-and-play safety constraint layer formulated via control …☆64Feb 21, 2026Updated 2 weeks ago
- ☆66Updated this week
- This repository is the official implementation of FLUX-CustomID. It is capable of generating images based on your face image at a level e…☆25Nov 13, 2024Updated last year
- an unofficial implementation of dreamtuner☆24Feb 23, 2024Updated 2 years ago
- Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".☆118Oct 9, 2025Updated 5 months ago
- EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing☆30Mar 29, 2024Updated last year
- State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP.☆45Nov 13, 2024Updated last year
- ☆32Apr 21, 2024Updated last year
- An Experimental Evaluation for Database Configuration Tuning☆28Mar 15, 2022Updated 3 years ago
- ☆34Feb 13, 2024Updated 2 years ago
- (EMNLP 2025 Main) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives☆37Dec 20, 2025Updated 2 months ago
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆38Dec 6, 2024Updated last year
- ☆43Aug 12, 2025Updated 6 months ago
- [AAAI 2025] LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation☆42Jan 7, 2025Updated last year
- Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…☆40Jun 26, 2024Updated last year
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆44Jul 14, 2024Updated last year
- [CVPR 2021] Official repository for "Prototype-supervised Adversarial Network for Targeted Attack of Deep Hashing"☆40Aug 28, 2022Updated 3 years ago
- Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)☆40Feb 15, 2023Updated 3 years ago
- An unofficial implementation of the paper “DiffEdit: Diffusion-based semantic image editing with mask guidance”☆39Jun 12, 2023Updated 2 years ago