alibaba/mm-diff

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibaba/mm-diff)

alibaba / mm-diff

MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration

☆28

Alternatives and similar repositories for mm-diff

Users that are interested in mm-diff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alibaba / EffiVED
View on GitHub
The official repository of EffiVED
☆19Jun 5, 2024Updated 2 years ago
alibaba / UVOSAM
View on GitHub
The official repository of UVOSAM
☆13Jun 5, 2024Updated 2 years ago
WesLee88524 / C-Drag-Official-Repo
View on GitHub
☆14Feb 28, 2025Updated last year
alibaba / wan-toy-transform
View on GitHub
This is a LoRA model finetuned on Wan-I2V-14B-480P. It turns things in the image into fluffy toys.
☆19Nov 10, 2025Updated 8 months ago
Rbrq03 / ClassDiffusion
View on GitHub
[ICLR2025] ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"
☆45Mar 11, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
agwmon / MuDI
View on GitHub
[NeurIPS 2024] MuDI: Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models
☆96Jan 17, 2025Updated last year
suchot / DevConsiStory
View on GitHub
experimental implementation of Consistory
☆20Jul 15, 2024Updated 2 years ago
kodenii / ORES
View on GitHub
ORES: Open-vocabulary Responsible Visual Synthesis
☆14Dec 12, 2023Updated 2 years ago
zhenghao977 / RetinaNet-Pytorch-36.4AP
View on GitHub
A pure torch implement of RetinaNet 36.4AP
☆10Aug 16, 2020Updated 5 years ago
kay-ck / GCMA
View on GitHub
[ACM MM2023] Code Release of GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos
☆12Mar 29, 2024Updated 2 years ago
Harxis / G2Face
View on GitHub
Official PyTorch Implementation for G2Face: High-Fidelity Reversible Face Anonymization via Generative and Geometric Priors (TIFS-2024)
☆17Aug 27, 2024Updated last year
Lucanyc / VISTA-Gym
View on GitHub
☆27Mar 17, 2026Updated 4 months ago
Dev-Mrha / DualPriorsCorrection
View on GitHub
☆14Oct 17, 2024Updated last year
hqhQAQ / PatchDPO
View on GitHub
[CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation
☆46Jul 1, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
rajatkoner08 / InstanceFormer
View on GitHub
☆19May 27, 2023Updated 3 years ago
kodenii / Responsible-Visual-Editing
View on GitHub
Responsible Visual Editing
☆15Jul 10, 2024Updated 2 years ago
eclipse-t2i / lambda-eclipse-inference
View on GitHub
[TMLR] Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent…
☆53Nov 29, 2024Updated last year
AMLResearchProject / all-arduino-nano-33-ble-sense-classifier
View on GitHub
The ALL Arduino Nano 33 BLE Sense Classifier is an experiment to explore how low powered microcontrollers, specifically the Arduino Nano …
☆10Jul 21, 2021Updated 5 years ago
CUC-MIPG / UnifyEdit
View on GitHub
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model
☆13Dec 29, 2024Updated last year
witcherofresearch / Qwen-Image-Style-Transfer
View on GitHub
The first model supporting content-preserving style transfer for Qwen-Image
☆33Jun 15, 2026Updated last month
7LFB / QAP
View on GitHub
☆12Dec 26, 2024Updated last year
alif-munim / minOFT
View on GitHub
A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.
☆14Nov 17, 2023Updated 2 years ago
VidCapBench / VidCapBench
View on GitHub
☆13May 17, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
lavinal712 / control-lora-v3
View on GitHub
☆11Dec 15, 2025Updated 7 months ago
kennyworkman / janeway
View on GitHub
immunobiology
☆15Jan 19, 2026Updated 6 months ago
JosephGesnouin / Asymmetrical-Bi-RNNs-to-encode-pedestrian-trajectories
View on GitHub
Code for the paper: Asymmetrical Bi-RNNs to encode pedestrian trajectories on trajnet++ dataset
☆11Oct 7, 2021Updated 4 years ago
kodenii / Ref-Diff
View on GitHub
Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models
☆21May 29, 2025Updated last year
junhahyung / MagiCapture
View on GitHub
☆11Feb 26, 2024Updated 2 years ago
mengtang-lab / selfcross-guidance
View on GitHub
Code for Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects
☆13Mar 5, 2026Updated 4 months ago
Egg-Hu / LoRA-Recycle
View on GitHub
[CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
☆14Jun 20, 2025Updated last year
ludc506 / InternVL-X
View on GitHub
☆16Mar 26, 2025Updated last year
aim-uofa / FreeCustom
View on GitHub
[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
☆177Sep 1, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
silviazuffi / awol
View on GitHub
☆13Oct 14, 2024Updated last year
kodenii / ImaginaryNet
View on GitHub
ImaginaryNet: Learning Object Detectors without Real Images and Annotations
☆26Mar 11, 2023Updated 3 years ago
RhapsodyAILab / Awesome-MiniCPMV-Projects
View on GitHub
☆11Aug 19, 2024Updated last year
drmaxchen-gbc / HCC-deep-learning
View on GitHub
HCC_Deep_learning
☆18Jun 8, 2020Updated 6 years ago
zwl666666 / infusion
View on GitHub
Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
☆14Dec 19, 2025Updated 7 months ago
Jam1ezhang / RankCLIP
View on GitHub
Ranking-Consistent Language-Image Pretraining
☆15Oct 24, 2025Updated 8 months ago
Zehong-Ma / OVMR
View on GitHub
OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)
☆36Jun 16, 2025Updated last year