NJU-PCALab / OpenVid-1M
☆193Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for OpenVid-1M
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆91Updated 2 weeks ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆141Updated last month
- [NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models☆243Updated 2 weeks ago
- ☆93Updated 4 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆94Updated 6 months ago
- [Arxiv 2024] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion…☆147Updated 7 months ago
- ☆127Updated 2 weeks ago
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆91Updated 2 weeks ago
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆122Updated 5 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)☆218Updated 4 months ago
- ☆123Updated last month
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆240Updated last month
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆340Updated last month
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆50Updated this week
- [Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation☆211Updated 2 weeks ago
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆371Updated 2 months ago
- Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)☆130Updated 6 months ago
- GenEval: An object-focused framework for evaluating text-to-image alignment☆120Updated 3 months ago
- ☆54Updated 3 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆250Updated 3 weeks ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆110Updated last month
- The HD-VG-130M Dataset☆108Updated 7 months ago
- ☆114Updated 4 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆212Updated 3 months ago
- ☆105Updated 8 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆47Updated 2 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆106Updated last month
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆75Updated 7 months ago
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆111Updated this week