Yuanshi9815 / Subjects200KView external linksLinks
Subjects200K dataset
☆129Jan 17, 2025Updated last year
Alternatives and similar repositories for Subjects200K
Users that are interested in Subjects200K are comparing it to the libraries listed below
Sorting:
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,903Jul 3, 2025Updated 7 months ago
- PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.☆15Jun 4, 2024Updated last year
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆108Sep 27, 2025Updated 4 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Nov 15, 2024Updated last year
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆214Sep 27, 2025Updated 4 months ago
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".☆117May 3, 2025Updated 9 months ago
- ☆35Nov 5, 2024Updated last year
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆18Sep 1, 2024Updated last year
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated 11 months ago
- ☆109Nov 27, 2024Updated last year
- ☆572Nov 26, 2024Updated last year
- diffusers with search engine☆12Jan 13, 2026Updated last month
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 4 months ago
- Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (…☆173Feb 27, 2024Updated last year
- ☆29May 7, 2025Updated 9 months ago
- Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Con…☆476Oct 21, 2024Updated last year
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated last year
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,350Sep 12, 2025Updated 5 months ago
- [ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance☆307Jul 30, 2025Updated 6 months ago
- Vision Bridge Transformer at Scale☆139Dec 1, 2025Updated 2 months ago
- Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning☆314Jul 11, 2024Updated last year
- ☆17Dec 11, 2024Updated last year
- [NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance☆131Oct 13, 2024Updated last year
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆44May 28, 2024Updated last year
- Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers☆14Feb 7, 2025Updated last year
- ☆32Oct 4, 2025Updated 4 months ago
- Trying to implement https://arxiv.org/abs/2305.08891☆34Jun 10, 2023Updated 2 years ago
- [ICCV 25] OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting☆311Oct 23, 2025Updated 3 months ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆758Nov 16, 2023Updated 2 years ago
- A novel image harmonization method based on Implicit Neural Representation.☆76Oct 29, 2025Updated 3 months ago
- Official PyTorch Implementation for Readout Guidance, CVPR 2024☆152Jun 26, 2025Updated 7 months ago
- (ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"☆74Feb 13, 2025Updated last year
- [NeurIPS 2023 Spotlight] Real-World Image Variation by Aligning Diffusion Inversion Chain☆153Jan 2, 2024Updated 2 years ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆661Nov 10, 2025Updated 3 months ago
- [ICCV 2025] Official implementation for KV-Edit: Training-Free Image Editing for Precise Background Preservation☆368May 21, 2025Updated 8 months ago
- Scalable and memory-optimized training of diffusion models☆1,335Jun 4, 2025Updated 8 months ago
- More suitable IP-Adapter for the DiT architecture☆31Jul 5, 2024Updated last year
- Official code for VMix: Improving Text-to-Image Diffusion Model with Cross-Attention Mixing Control☆191Dec 31, 2024Updated last year
- ☆71Nov 18, 2024Updated last year