Video Diffusion Transformers are In-Context Learners
☆35Jan 6, 2025Updated last year
Alternatives and similar repositories for Video-In-Context
Users that are interested in Video-In-Context are comparing it to the libraries listed below
Sorting:
- Blending Custom Photos with Video Diffusion Transformers☆48Jan 21, 2025Updated last year
- ☆18Mar 21, 2025Updated 11 months ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆18May 2, 2025Updated 10 months ago
- [CVPR 2024] BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation☆45May 7, 2024Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆69May 18, 2025Updated 10 months ago
- CVPR 2025 Accepted Papers☆24Dec 20, 2025Updated 3 months ago
- Keypad Modules are expensive so to relace them in this video i am going to show you how you can make combinational lock using push button…☆10Mar 30, 2019Updated 6 years ago
- ☆22Mar 7, 2025Updated last year
- Loop your image from output to input in your ComfyUI workflow☆14Jan 16, 2026Updated 2 months ago
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Models☆52Sep 10, 2025Updated 6 months ago
- This repository extends the mask editor in Comfyui and supports lasso method for applying masks☆14Jul 23, 2025Updated 7 months ago
- ☆13Jul 10, 2024Updated last year
- ☆13Mar 8, 2024Updated 2 years ago
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆322Mar 30, 2025Updated 11 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Mar 12, 2024Updated 2 years ago
- ☆37Jan 26, 2026Updated last month
- Responsible Visual Editing☆15Jul 10, 2024Updated last year
- EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing☆30Mar 29, 2024Updated last year
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 8 months ago
- ☆52Dec 20, 2024Updated last year
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆79Jul 29, 2025Updated 7 months ago
- ☆17May 13, 2025Updated 10 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆56Aug 16, 2025Updated 7 months ago
- ☆82Oct 13, 2025Updated 5 months ago
- Benchmark dataset and code of MSRVTT-Personalization☆52Nov 10, 2025Updated 4 months ago
- ☆15Mar 30, 2025Updated 11 months ago
- Code for "SePPO: Semi-Policy Preference Optimization for Diffusion Alignment."☆18Oct 7, 2024Updated last year
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Nov 23, 2023Updated 2 years ago
- Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).☆100Feb 11, 2025Updated last year
- Official repository for the paper "Audio ControlNet for Fine-Grained Audio Generation and Editing".☆64Feb 7, 2026Updated last month
- Lora traing script for Lightricks LTX-video☆70Feb 12, 2025Updated last year
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- An experimental node☆44Oct 28, 2025Updated 4 months ago
- ☆16Feb 23, 2025Updated last year
- The official implementation of 'GRID: Visual Layout Generation.'☆21Dec 28, 2024Updated last year
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Jan 26, 2025Updated last year
- [ECCV 2024] Official code for: SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer☆113Jun 30, 2025Updated 8 months ago
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year