xiangyu-mm/UniFashion

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xiangyu-mm/UniFashion)

xiangyu-mm / UniFashion

The official code for paper "UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation"

☆38

Alternatives and similar repositories for UniFashion

Users that are interested in UniFashion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

anishaman6206 / Fashion-Recommender-and-Outfit-Matcher
View on GitHub
AI Enhanced Fashion Recommender
☆10May 1, 2026Updated 2 months ago
qzp2018 / UniECS
View on GitHub
Official implement of CIKM2025: 《UniECS: Unified Multimodal E-Commerce Search Framework with Gated Cross-modal Fusion》
☆21Sep 17, 2025Updated 10 months ago
iLearn-Lab / ACM-MM25-PUMA
View on GitHub
[ACM MM 2025] PUMA: Layer-Pruned Language Model for Efficient Unified Multimodal Retrieval with Modality-Adaptive Learning
☆18Jun 6, 2026Updated last month
Ma-Hongbo / StyleTailor
View on GitHub
Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”
☆35Mar 1, 2026Updated 4 months ago
Taited / sgdiff
View on GitHub
Official implementation of SGDiff (ACM MM '23)
☆36Nov 26, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Jul 19, 2026Updated last week
wz0919 / DreamRunner
View on GitHub
[AAAI 2026] Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation
☆78Jun 11, 2025Updated last year
Monoxide-Chen / uncertainty_retrieval
View on GitHub
ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization
☆74Jan 30, 2024Updated 2 years ago
georgiarg / Prompt2Fashion
View on GitHub
Prompt2Fashion: An automatically generated fashion dataset
☆16Aug 12, 2024Updated last year
chunmeifeng / SPRC
View on GitHub
【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval
☆94Apr 16, 2024Updated 2 years ago
hltcoe / rank-k
View on GitHub
Repository for the listwise reranker Rank-K
☆16May 23, 2025Updated last year
ChenAnno / FashionERN_AAAI2024
View on GitHub
Official implementation for "FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval"
☆20Oct 27, 2025Updated 9 months ago
songyiren98 / CLIPFont
View on GitHub
Implementation of paper: CLIPFont: Texture Guided Vector WordArt Generation
☆18Oct 8, 2022Updated 3 years ago
BIGKnight / Understanding-Training-free-Diffusion-Guidance
View on GitHub
☆19Mar 18, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
YingqingHe / ScaleCrafter-ptl
View on GitHub
☆14Oct 16, 2023Updated 2 years ago
wtybest / EnMMDiT
View on GitHub
[TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation
☆15Mar 7, 2026Updated 4 months ago
liujin112 / PortraitDiffusion
View on GitHub
☆25Dec 7, 2023Updated 2 years ago
yiren-jian / BLIText
View on GitHub
[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
☆26Dec 5, 2023Updated 2 years ago
Chiangsonw / CaLa
View on GitHub
The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"
☆15Sep 19, 2024Updated last year
TIGER-AI-Lab / ABC
View on GitHub
ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]
☆20Aug 21, 2025Updated 11 months ago
junfeng0288 / MathReal
View on GitHub
☆16Aug 11, 2025Updated 11 months ago
WeihuangLin / INF-LLaVA
View on GitHub
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
☆42Aug 4, 2024Updated last year
zipengxuc / SpectralCLIP
View on GitHub
Code for WACV 2024 paper ✨ "SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective".
☆19Nov 4, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
TempleX98 / EasyRef
View on GitHub
[ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
☆73Jul 16, 2025Updated last year
PeterGriffinJin / LMIndexer
View on GitHub
Language Models as Semantic Indexers (ICML 2024)
☆43May 2, 2024Updated 2 years ago
WangWenhao0716 / TIP-I2V
View on GitHub
[ICCV 2025] TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation
☆41Nov 27, 2024Updated last year
LHL3341 / ContextBLIP
View on GitHub
ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions [ACL 2024]
☆11May 17, 2024Updated 2 years ago
AlbertiPot / nar
View on GitHub
codes for Neural Architecture Ranker and detailed cell information datasets based on NAS-Bench series
☆12Jul 11, 2022Updated 4 years ago
Leon1207 / 3DRefTR
View on GitHub
This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"
☆26Aug 24, 2023Updated 2 years ago
jerrylin0809 / pac-bayesian-dendrogram-cut
View on GitHub
☆12May 10, 2021Updated 5 years ago
showlab / DIM
View on GitHub
[ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing
☆28May 11, 2026Updated 2 months ago
hithqd / ReasonBrain
View on GitHub
【ICML2026】Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning
☆27May 18, 2026Updated 2 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
VSAnimator / Sketch-a-Sketch
View on GitHub
Controlling diffusion-based image generation with just a few strokes
☆66Dec 21, 2023Updated 2 years ago
FMXExpress / ControlNet-Sketch-To-Image
View on GitHub
Sketch an image and generate a Stable Diffusion image from it using ControlNet Scribble.
☆17May 29, 2023Updated 3 years ago
SihuiJi / FashionComposer
View on GitHub
☆24Dec 23, 2024Updated last year
nayeon7lee / factuality_enhanced_lm_hf
View on GitHub
☆13Nov 11, 2022Updated 3 years ago
qzp2018 / AnyTrans
View on GitHub
AnyTrans: Translate AnyText in the Image with Large Scale Models (EMNLP2024 Findings)
☆25Dec 11, 2024Updated last year
deepglint / UniME
View on GitHub
[ACM MM 2025] The official code of "Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs"
☆105Dec 8, 2025Updated 7 months ago
OmkarThawakar / composed-video-retrieval
View on GitHub
Composed Video Retrieval
☆62May 2, 2024Updated 2 years ago