mihirp1998 / VADER
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
☆196Updated last month
Related projects: ⓘ
- Code repository for T2V-Turbo☆166Updated 2 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆199Updated 2 months ago
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆146Updated 5 months ago
- CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆210Updated 2 weeks ago
- ☆235Updated last month
- [CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts☆251Updated 3 months ago
- Official code for 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆122Updated 4 months ago
- [SIGGRAPH Asia 2024 (Journal Track)]StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter☆182Updated 2 months ago
- [Arxiv 2024] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion…☆139Updated 5 months ago
- Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)☆127Updated 3 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆90Updated 2 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆144Updated 2 weeks ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆125Updated 7 months ago
- I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models☆197Updated 8 months ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆155Updated last month
- The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆85Updated last month
- Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)☆161Updated 5 months ago
- Paint by Inpaint: Learning to Add Image Objects by Removing Them First☆81Updated 3 weeks ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆228Updated 6 months ago
- [CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"☆266Updated 3 months ago
- Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"☆174Updated 2 months ago
- ☆99Updated 6 months ago
- Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step☆133Updated 2 months ago
- ☆90Updated 6 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆93Updated 4 months ago
- TrailBlazer: Trajectory Control for Diffusion-Based Video Generation☆88Updated 3 months ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafter☆364Updated 2 months ago
- ☆70Updated 2 months ago
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆87Updated 5 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆118Updated 2 weeks ago