JiaojiaoYe1994 / Awesome-DIffusionModels-paper
A curasted list of papers with the topic of Diffusion Models for Multi-Modal
☆25Updated 11 months ago
Alternatives and similar repositories for Awesome-DIffusionModels-paper:
Users that are interested in Awesome-DIffusionModels-paper are comparing it to the libraries listed below
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆54Updated last year
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆114Updated 2 months ago
- pytorch ddpm demo☆80Updated last year
- Diffusion Transformers (DiTs) trained on MNIST dataset☆81Updated 9 months ago
- Record some basic training on the stable diffusion series, including Lora, Controlnet, IP-adapter, and a bit of fun AIGC play!☆28Updated 5 months ago
- Materials for the Hugging Face Diffusion Models Course☆206Updated last year
- 本仓库旨在介绍如何通过源码编译的方法成功安装mamba,可解决selective_scan_cuda和本地cuda环境冲突的问题☆55Updated 2 months ago
- pytorch复现stable diffusion☆145Updated last year
- a super easy clip model with mnist dataset for study☆89Updated 10 months ago
- the repository of A survey on image-text multimodal models☆41Updated 9 months ago
- 一份pytorch模型训练框架,方便快速设计和开始训练一个模型☆65Updated 2 years ago
- [ICCV 2023 Oral] Official implementation for "DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion."☆221Updated this week
- A collection of awesome text-to-image generation studies.☆493Updated this week
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆263Updated 3 weeks ago
- ☆139Updated last year
- 算法岗笔试面试大全,励志做算法届的《五年高考,三年模拟》!☆277Updated last month
- [ACL 2024 Best Paper] Deciphering Oracle Bone Language with Diffusion Models☆93Updated last month
- Stable Diffusion模型训练样例代码☆25Updated 7 months ago
- 文章阅读记录☆67Updated this week
- [Paper][AAAI2024]Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-modal Structured Representations☆126Updated 6 months ago
- Speechless at the original stable-diffusion☆91Updated 6 months ago
- 多模态 MM +Chat 合集☆239Updated last week
- CV算法工程师面试知识点整理☆26Updated last year
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆115Updated 8 months ago
- [CVPR'24 Highlight] Implementation of "Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models"☆13Updated 4 months ago
- Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models☆235Updated last week
- A collection of papers on Diffusion for Image-to-Image Translation and Style Transfer☆156Updated this week
- Official repository of Agent Attention (ECCV2024)☆578Updated 2 months ago
- 包含程序员面试大厂面试题和面试经验☆116Updated 3 weeks ago
- [MIR-2023-Survey] A continuously updated paper list for multi-modal pre-trained big models☆284Updated this week