JiaojiaoYe1994 / Awesome-DIffusionModels-paper
A curasted list of papers with the topic of Diffusion Models for Multi-Modal
☆26Updated last year
Alternatives and similar repositories for Awesome-DIffusionModels-paper:
Users that are interested in Awesome-DIffusionModels-paper are comparing it to the libraries listed below
- Stable Diffusion模型训练样例代码☆35Updated 10 months ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆289Updated last month
- pytorch复现stable diffusion☆166Updated last year
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆136Updated 10 months ago
- the repository of A survey on image-text multimodal models☆42Updated last year
- pytorch ddpm demo☆88Updated last year
- Materials for the Hugging Face Diffusion Models Course☆220Updated 2 years ago
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆56Updated last year
- Diffusion Transformers (DiTs) trained on MNIST dataset☆102Updated last year
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆120Updated 5 months ago
- a super easy clip model with mnist dataset for study☆109Updated last year
- Record some basic training on the stable diffusion series, including Lora, Controlnet, IP-adapter, and a bit of fun AIGC play!☆31Updated 8 months ago
- ☆53Updated 4 months ago
- 本仓库旨在介绍如何通过源码编译的方法成功安装mamba,可解决selective_scan_cuda和本地cuda环境冲突的问题☆80Updated 2 months ago
- Official repository of MLLA (NeurIPS 2024)☆312Updated 5 months ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆180Updated 11 months ago
- A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)☆304Updated last month
- Quality-aware multimodal fusion on ICML 2023☆97Updated last month
- A curated list of balanced multimodal learning methods.☆60Updated last week
- [MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models☆286Updated 2 months ago
- 多模态 MM +Chat 合集☆255Updated 2 months ago
- A collection of awesome text-to-image generation studies.☆585Updated last month
- Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models☆382Updated 2 weeks ago
- 历年ICLR论文和开源项目合集,包含ICLR2021、ICLR2022、ICLR2023、ICLR2024、ICLR2025.☆243Updated last month
- ☆150Updated last year
- 🎮 A Benchmark and Awesome Collection of Methods for Remote Sensing Image-Text Retrieval (RSITR)| Remote Sensing Cross-Model Retrieval (R…☆53Updated last month
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆61Updated 2 months ago
- ☆221Updated last month
- A curated list of papers on the applications of RWKV in computer vision.☆169Updated 2 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆161Updated 3 months ago