mihirp1998 / VADERView external linksLinks
Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various reward models such as HPS, PickScore, VideoMAE, VJEPA, YOLO, Aesthetics etc.
☆306Mar 12, 2025Updated 11 months ago
Alternatives and similar repositories for VADER
Users that are interested in VADER are comparing it to the libraries listed below
Sorting:
- Code repository for T2V-Turbo and T2V-Turbo-v2☆310Jan 31, 2025Updated last year
- Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis☆645May 24, 2024Updated last year
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆420Sep 24, 2025Updated 4 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆310Nov 1, 2024Updated last year
- [CVPR 2025] Consistent and Controllable Image Animation with Motion Diffusion Models☆294May 17, 2025Updated 8 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated last year
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆262Apr 7, 2025Updated 10 months ago
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆503Sep 2, 2024Updated last year
- [AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆374Mar 26, 2025Updated 10 months ago
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆661Nov 10, 2025Updated 3 months ago
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation☆394May 30, 2025Updated 8 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆321Aug 10, 2024Updated last year
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆111Dec 4, 2025Updated 2 months ago
- (CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models☆1,202Aug 7, 2025Updated 6 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆512Dec 11, 2024Updated last year
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆625Oct 29, 2025Updated 3 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆180Jan 30, 2026Updated 2 weeks ago
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆613Jul 1, 2025Updated 7 months ago
- [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,876Jan 8, 2026Updated last month
- ☆636May 24, 2024Updated last year
- Official Implementation of VideoDPO☆160Jun 1, 2025Updated 8 months ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆993Nov 25, 2025Updated 2 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆85May 4, 2025Updated 9 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆90May 12, 2025Updated 9 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Apr 10, 2024Updated last year
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆242Apr 6, 2024Updated last year
- [ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation☆516Jun 17, 2025Updated 7 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 4 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆638Oct 16, 2025Updated 3 months ago
- ☆414Mar 10, 2025Updated 11 months ago
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,634Sep 25, 2024Updated last year
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆286Dec 4, 2024Updated last year
- ☆13Jul 10, 2024Updated last year
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆71Jul 16, 2025Updated 6 months ago
- VideoSys: An easy and efficient system for video generation☆2,017Aug 27, 2025Updated 5 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆671Updated this week
- Next-Token Prediction is All You Need☆2,339Jan 12, 2026Updated last month
- ☆66Jun 4, 2024Updated last year
- Scalable and memory-optimized training of diffusion models☆1,335Jun 4, 2025Updated 8 months ago