Yuanshi9815 / ViBTLinks
Vision Bridge Transformer at Scale
☆126Updated 3 weeks ago
Alternatives and similar repositories for ViBT
Users that are interested in ViBT are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆84Updated 3 months ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆146Updated 2 months ago
- This is the project for 'Any2Caption', Interpreting Any Condition to Caption for Controllable Video Generation☆50Updated 8 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆113Updated last week
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆68Updated 7 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆238Updated 4 months ago
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices☆81Updated 3 weeks ago
- Implementation Code for Omni-Effects☆163Updated 2 weeks ago
- pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation☆224Updated this week
- ☆29Updated 9 months ago
- DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder☆172Updated 2 months ago
- Transition Models☆137Updated 2 months ago
- Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation☆193Updated last week
- Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing☆43Updated this week
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆192Updated last week
- [WACV 2025] MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆96Updated 8 months ago
- [CVPR 2025 Oral] Alias-free Latent Diffusion Models (official implementation)☆102Updated 2 weeks ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆95Updated 9 months ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated 11 months ago
- ☆47Updated 8 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆87Updated 7 months ago
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆65Updated 7 months ago
- 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆81Updated 2 weeks ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆259Updated 4 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆73Updated 11 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆53Updated 4 months ago
- Pixel-Space Generative Models☆284Updated 7 months ago
- [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing☆25Updated last month
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆72Updated 5 months ago
- Distilling Diversity and Control in Diffusion Models☆49Updated 7 months ago