A one-stop library to standardize the inference and evaluation of all the conditional video generation models.
☆51Feb 13, 2025Updated last year
Alternatives and similar repositories for VideoGenHub
Users that are interested in VideoGenHub are comparing it to the libraries listed below
Sorting:
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆17Aug 30, 2024Updated last year
- ☆17Jul 30, 2024Updated last year
- Animefy: ComfyUI workflow designed to convert images or videos into an anime-like style automatically.☆22Jul 2, 2024Updated last year
- ☆144Jun 30, 2024Updated last year
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Jul 21, 2024Updated last year
- This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR 2025]☆79Jul 1, 2025Updated 8 months ago
- ☆30May 9, 2024Updated last year
- [ACL 2025 Main] Open-source toolkit for automatic evaluation of text-to-image generation task, including training & test datasets and a d…☆16Jul 5, 2025Updated 8 months ago
- ☆20Jun 26, 2024Updated last year
- ☆15Jan 8, 2024Updated 2 years ago
- ☆14Oct 16, 2023Updated 2 years ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- ☆86Aug 21, 2024Updated last year
- FlexiFilm: Long Video Generation with Flexible Conditions☆31May 1, 2024Updated last year
- [NeurIPS 2024] Official Implementation of GrounDiT☆59Dec 12, 2024Updated last year
- ☆19Jul 11, 2024Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.☆200Jul 22, 2024Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆132Feb 7, 2024Updated 2 years ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆314Jan 31, 2025Updated last year
- Browser viewer for GaussianAvatars based on Brush☆26Dec 23, 2024Updated last year
- Skybox previewer and generator using BlockadeLabs☆15May 13, 2023Updated 2 years ago
- A curated list of papers and resources for text-to-image evaluation.☆30Sep 6, 2023Updated 2 years ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆21Feb 27, 2025Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …☆162Apr 7, 2024Updated last year
- [SIGGRAPH 2025] Official implementation of 'Motion Inversion For Video Customization'☆153Oct 22, 2024Updated last year
- ☆132Feb 13, 2024Updated 2 years ago
- CogVideoX-LoRAs is a centralized repository for all LoRA models created for CogVideoX, filling the gap for a unified sharing space. With …☆81Dec 4, 2024Updated last year
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆39Feb 17, 2025Updated last year
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated 2 years ago
- Dungeon procedural generator similar to whatabou's "One Page Dungeon"☆50Jan 4, 2026Updated 2 months ago
- ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation [TMLR 2024]☆260Jul 1, 2024Updated last year
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆44Aug 22, 2024Updated last year
- ☆20Sep 17, 2024Updated last year
- ☆16Apr 23, 2024Updated last year
- SliderSpace: Decomposing the Visual Capabilities of Diffusion Models☆118Nov 25, 2025Updated 3 months ago
- This respository contains the code for the CVPR 2024 paper AVID: Any-Length Video Inpainting with Diffusion Model.☆177Feb 27, 2024Updated 2 years ago
- Interactive Video Generation via Masked-Diffusion☆107Apr 15, 2024Updated last year
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆78Oct 15, 2024Updated last year