KwaiVGI / Koala-36M
Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".
☆117Updated 2 months ago
Alternatives and similar repositories for Koala-36M:
Users that are interested in Koala-36M are comparing it to the libraries listed below
- Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers☆88Updated this week
- ☆221Updated 6 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 6 months ago
- ☆59Updated 5 months ago
- The HD-VG-130M Dataset☆114Updated 9 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆67Updated last month
- ☆98Updated 6 months ago
- Subjects200K dataset☆90Updated this week
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆150Updated 3 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆59Updated this week
- UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing☆101Updated 2 months ago
- CCEdit: Creative and Controllable Video Editing via Diffusion Models☆101Updated 7 months ago
- ☆124Updated 3 months ago
- [AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with…☆108Updated 3 months ago
- Code for ROICtrl: Boosting Instance Control for Visual Generation☆99Updated last month
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆31Updated last month
- ☆69Updated 7 months ago
- ☆182Updated last month
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆63Updated 4 months ago
- [ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation☆72Updated last month
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"☆67Updated last month
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆70Updated 3 months ago
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆66Updated 3 weeks ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆78Updated 9 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆93Updated 10 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆95Updated 11 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆123Updated 3 months ago
- Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆109Updated this week
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆45Updated 3 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆102Updated last month