A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it
โ143Jan 22, 2024Updated 2 years ago
Alternatives and similar repositories for Video-BLIP2-Preprocessor
Users that are interested in Video-BLIP2-Preprocessor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Finetune ModelScope's Text To Video model using Diffusers ๐งจโ698Dec 14, 2023Updated 2 years ago
- โ17Jul 30, 2024Updated last year
- Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2โฆโ86Apr 22, 2023Updated 3 years ago
- Text to Videoโ26Mar 28, 2023Updated 3 years ago
- Generate images from an initial frame and textโ37Jul 28, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean โข AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Supercharged BLIP-2 that can handle videosโ124Dec 1, 2023Updated 2 years ago
- The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".โ308Oct 19, 2025Updated 6 months ago
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllabilityโ955Nov 11, 2023Updated 2 years ago
- Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"โ404Jul 4, 2023Updated 2 years ago
- โ25Apr 15, 2023Updated 3 years ago
- AnimationDiff with trainโ126Feb 26, 2024Updated 2 years ago
- [ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.โ1,048Aug 21, 2024Updated last year
- A modular graph based DataSet implementation for Pytorchโ38Mar 17, 2026Updated last month
- โ470Feb 12, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean โข AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Retrieval-Augmented Video Generation for Telling a Storyโ259Feb 5, 2024Updated 2 years ago
- โ17Jun 15, 2022Updated 3 years ago
- [ICLR 2024] Official pytorch implementation of "ControlVideo: Training-free Controllable Text-to-Video Generation"โ863Oct 12, 2023Updated 2 years ago
- T2VScore: Towards A Better Metric for Text-to-Video Generationโ81Apr 10, 2024Updated 2 years ago
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videosโ19Mar 3, 2025Updated last year
- implementation of AnimateDiff.โ32Jul 14, 2023Updated 2 years ago
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Propertiesโ132Nov 10, 2024Updated last year
- โ48Mar 12, 2025Updated last year
- [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Modelsโ544Jan 18, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer โข AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paperโ168May 7, 2024Updated last year
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Modelsโ951Nov 13, 2024Updated last year
- RepText: Rendering Visual Text via Replicating ๐ฅโ140Jun 7, 2025Updated 10 months ago
- Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videosโ28Dec 8, 2023Updated 2 years ago
- [SIGGRAPH Asia 2024] TrailBlazer: Trajectory Control for Diffusion-Based Video Generationโ102May 31, 2024Updated last year
- โ79Dec 12, 2025Updated 4 months ago
- Fine-Grained Open Domain Image Animation with Motion Guidanceโ967Oct 18, 2024Updated last year
- Stable Video Diffusion Training Code and Extensions.โ734Jul 25, 2024Updated last year
- [AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion โฆโ163Apr 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer โข AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2024] ViT-Lens: Towards Omni-modal Representationsโ190Feb 3, 2025Updated last year
- Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model (ICLR โฆโ470Feb 11, 2025Updated last year
- โ58Apr 24, 2024Updated 2 years ago
- [ICLR 2024] Code for FreeNoise based on VideoCrafterโ429Aug 25, 2025Updated 8 months ago
- โ31Jul 25, 2023Updated 2 years ago
- Rembg is a tool to remove images background.โ12Nov 29, 2022Updated 3 years ago
- Open-Sora: Democratizing Efficient Video Production for Allโ19Nov 7, 2024Updated last year