A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it
β142Jan 22, 2024Updated 2 years ago
Alternatives and similar repositories for Video-BLIP2-Preprocessor
Users that are interested in Video-BLIP2-Preprocessor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Finetune ModelScope's Text To Video model using Diffusers π§¨β698Dec 14, 2023Updated 2 years ago
- β17Jul 30, 2024Updated last year
- Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2β¦β85Apr 22, 2023Updated 3 years ago
- Text to Videoβ26Mar 28, 2023Updated 3 years ago
- Generate images from an initial frame and textβ37Jul 28, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Supercharged BLIP-2 that can handle videosβ124Dec 1, 2023Updated 2 years ago
- The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".β308Oct 19, 2025Updated 7 months ago
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllabilityβ956Nov 11, 2023Updated 2 years ago
- Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"β405Jul 4, 2023Updated 2 years ago
- β25Apr 15, 2023Updated 3 years ago
- AnimationDiff with trainβ125Feb 26, 2024Updated 2 years ago
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.β1,047Aug 21, 2024Updated last year
- A modular graph based DataSet implementation for Pytorchβ38May 1, 2026Updated last month
- β470Feb 12, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Retrieval-Augmented Video Generation for Telling a Storyβ258Feb 5, 2024Updated 2 years ago
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"β862Oct 12, 2023Updated 2 years ago
- β17Jun 15, 2022Updated 3 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generationβ81Apr 10, 2024Updated 2 years ago
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videosβ20Mar 3, 2025Updated last year
- implementation of AnimateDiff.β32Jul 14, 2023Updated 2 years ago
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Propertiesβ133Nov 10, 2024Updated last year
- β20Aug 23, 2025Updated 9 months ago
- β47Mar 12, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Modelsβ543Jan 18, 2024Updated 2 years ago
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paperβ170May 7, 2024Updated 2 years ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Modelsβ951Nov 13, 2024Updated last year
- RepText: Rendering Visual Text via Replicating π₯β140Jun 7, 2025Updated last year
- Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videosβ28Dec 8, 2023Updated 2 years ago
- [SIGGRAPH Asia 2024] TrailBlazer: Trajectory Control for Diffusion-Based Video Generationβ102May 31, 2024Updated 2 years ago
- β79May 18, 2026Updated 3 weeks ago
- Fine-Grained Open Domain Image Animation with Motion Guidanceβ965Oct 18, 2024Updated last year
- Stable Video Diffusion Training Code and Extensions.β732Jul 25, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion β¦β164Apr 7, 2024Updated 2 years ago
- [CVPR 2024] ViT-Lens: Towards Omni-modal Representationsβ190Feb 3, 2025Updated last year
- Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR β¦β468Feb 11, 2025Updated last year
- β58Apr 24, 2024Updated 2 years ago
- β31Jul 25, 2023Updated 2 years ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafterβ429Aug 25, 2025Updated 9 months ago
- Rembg is a tool to remove images background.β12Nov 29, 2022Updated 3 years ago