bytedance / ContentVLinks
☆132Updated 7 months ago
Alternatives and similar repositories for ContentV
Users that are interested in ContentV are comparing it to the libraries listed below
Sorting:
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆121Updated 11 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆100Updated 4 months ago
- Glance: Accelerating Diffusion Models with 1 Sample☆152Updated last month
- VideoCoF: Unified Video Editing with Temporal Reasoner☆134Updated last month
- ☆141Updated 3 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆244Updated 5 months ago
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆114Updated 8 months ago
- DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder☆178Updated 4 months ago
- Blending Custom Photos with Video Diffusion Transformers☆48Updated last year
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆118Updated 4 months ago
- Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback☆222Updated last week
- ☆81Updated 3 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆126Updated last month
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆112Updated 4 months ago
- DiT for VAE (and Video Generation)☆35Updated last year
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆165Updated last year
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆33Updated 4 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆130Updated 9 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆184Updated 10 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆269Updated 6 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆78Updated 5 months ago
- ☆92Updated 5 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆165Updated 7 months ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆148Updated 3 months ago
- 4-steps distilled version of Wan2.2-TI2V-5B☆133Updated last week
- Concat-ID: Towards Universal Identity-Preserving Video Synthesis☆65Updated 8 months ago
- [ICML 2025] Official Implementation of Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots☆29Updated 8 months ago
- Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"☆76Updated last month
- An official implementation of SwapAnyone.☆74Updated 10 months ago
- [ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆54Updated last week