DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models
☆46Dec 21, 2023Updated 2 years ago
Alternatives and similar repositories for diffblender
Users that are interested in diffblender are comparing it to the libraries listed below
Sorting:
- ☆65Jun 2, 2023Updated 2 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Apr 29, 2025Updated 10 months ago
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆316Jul 11, 2024Updated last year
- Official pytorch implementation for SingleInsert☆28Apr 19, 2024Updated last year
- PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation☆37Oct 28, 2024Updated last year
- ☆67Jun 27, 2024Updated last year
- ☆31Jan 7, 2024Updated 2 years ago
- [CVPR2024] CapHuman: Capture Your Moments in Parallel Universes☆100Nov 20, 2024Updated last year
- Self-Contrastive Learning: Single-viewed Supervised Contrastive Framework using Sub-network (AAAI 2023)☆21Oct 28, 2023Updated 2 years ago
- Geometry-aware Novel View Synthesis with Pre-trained 2D Prior☆39Jun 3, 2023Updated 2 years ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Jul 21, 2024Updated last year
- [WACV 2026] PyTorch code for 4D-Animal.☆27Nov 18, 2025Updated 3 months ago
- A PyTorch implementation of the paper "MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis".☆12Jan 16, 2023Updated 3 years ago
- ☆11Jan 16, 2024Updated 2 years ago
- TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views☆18Jan 14, 2026Updated last month
- Responsible Visual Editing☆15Jul 10, 2024Updated last year
- ☆32Jun 26, 2024Updated last year
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- ☆30May 9, 2024Updated last year
- ☆58Apr 11, 2024Updated last year
- ☆16Feb 21, 2025Updated last year
- ☆17Jul 30, 2024Updated last year
- ☆15Sep 10, 2023Updated 2 years ago
- 한국 정부 국가 AI 파운데이션 모델 5개 기관(Upstage, NAVER, SKT, NC, LG)의 공개 모델이 실제로 from scratch로 학습되었는지 검증하는 프로젝트☆47Jan 9, 2026Updated last month
- Repository for the PyOpenGL Project (LaunchPad Mirror)☆16Jul 9, 2019Updated 6 years ago
- [IROS 2025] CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting☆30Jul 25, 2025Updated 7 months ago
- ☆15Jan 8, 2024Updated 2 years ago
- Adaptive Nonlinear Latent Transformation for Conditional Face Editing (ICCV 2023)☆37Jul 30, 2023Updated 2 years ago
- ☆21Nov 21, 2024Updated last year
- ☆15Apr 29, 2025Updated 10 months ago
- [ECCV 2024] Official repository of ECCV 2024 paper: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion M…☆15May 24, 2025Updated 9 months ago
- ☆16Feb 23, 2025Updated last year
- ☆49Feb 9, 2026Updated 3 weeks ago
- ComfyUI-HiggsAudio is now available in ComfyUI, Higgs Audio v2 is a text-audio foundation model from Boson AI.☆22Jul 26, 2025Updated 7 months ago
- ☆18Nov 25, 2023Updated 2 years ago
- ControlNet control image preprocess library☆15Feb 27, 2023Updated 3 years ago
- ☆20Feb 9, 2026Updated 3 weeks ago
- Stable Diffusion-based image manipulation method with a sketch and reference image☆184Apr 23, 2023Updated 2 years ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆298Aug 29, 2025Updated 6 months ago