A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it
โ142Jan 22, 2024Updated 2 years ago
Alternatives and similar repositories for Video-BLIP2-Preprocessor
Users that are interested in Video-BLIP2-Preprocessor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Finetune ModelScope's Text To Video model using Diffusers ๐งจโ699Dec 14, 2023Updated 2 years ago
- โ17Jul 30, 2024Updated last year
- Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2โฆโ85Apr 22, 2023Updated 3 years ago
- Text to Videoโ26Mar 28, 2023Updated 3 years ago
- Generate images from an initial frame and textโ37Jul 28, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits โข AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Supercharged BLIP-2 that can handle videosโ124Dec 1, 2023Updated 2 years ago
- The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".โ308Oct 19, 2025Updated 8 months ago
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllabilityโ957Nov 11, 2023Updated 2 years ago
- Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"โ405Jul 4, 2023Updated 2 years ago
- โ25Apr 15, 2023Updated 3 years ago
- AnimationDiff with trainโ125Feb 26, 2024Updated 2 years ago
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.โ1,050Aug 21, 2024Updated last year
- A modular graph based DataSet implementation for Pytorchโ38May 1, 2026Updated last month
- โ469Feb 12, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer โข AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Retrieval-Augmented Video Generation for Telling a Storyโ258Feb 5, 2024Updated 2 years ago
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"โ862Oct 12, 2023Updated 2 years ago
- โ17Jun 15, 2022Updated 4 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generationโ81Apr 10, 2024Updated 2 years ago
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videosโ20Mar 3, 2025Updated last year
- implementation of AnimateDiff.โ32Jul 14, 2023Updated 2 years ago
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Propertiesโ133Nov 10, 2024Updated last year
- โ20Aug 23, 2025Updated 10 months ago
- โ47Mar 12, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Modelsโ544Jan 18, 2024Updated 2 years ago
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paperโ172May 7, 2024Updated 2 years ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Modelsโ953Nov 13, 2024Updated last year
- RepText: Rendering Visual Text via Replicating ๐ฅโ140Jun 7, 2025Updated last year
- Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videosโ28Dec 8, 2023Updated 2 years ago
- [SIGGRAPH Asia 2024] TrailBlazer: Trajectory Control for Diffusion-Based Video Generationโ102May 31, 2024Updated 2 years ago
- โ78May 18, 2026Updated last month
- Fine-Grained Open Domain Image Animation with Motion Guidanceโ967Oct 18, 2024Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion โฆโ163Apr 7, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform โข AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2024] ViT-Lens: Towards Omni-modal Representationsโ190Feb 3, 2025Updated last year
- Stable Video Diffusion Training Code and Extensions.โ733Jul 25, 2024Updated last year
- Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR โฆโ469Feb 11, 2025Updated last year
- โ58Apr 24, 2024Updated 2 years ago
- โ31Jul 25, 2023Updated 2 years ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafterโ430Aug 25, 2025Updated 10 months ago
- Rembg is a tool to remove images background.โ12Nov 29, 2022Updated 3 years ago