ExponentialML / Video-BLIP2-PreprocessorLinks

A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it

☆141

Alternatives and similar repositories for Video-BLIP2-Preprocessor

Users that are interested in Video-BLIP2-Preprocessor are comparing it to the libraries listed below

Sorting:

G-U-N / Gen-L-Video
The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
☆303Updated last year
HL-hanlin / VideoDirectorGPT
official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning (COLM 2024)
☆173Updated last year
AILab-CVC / Animate-A-Story
Retrieval-Augmented Video Generation for Telling a Story
☆258Updated last year
md-mohaiminul / VideoRecap
☆192Updated last year
TIGER-AI-Lab / ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation [TMLR 2024]
☆251Updated last year
Ground-A-Video / Ground-A-Video
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)
☆139Updated last year
OPPO-Mente-Lab / Subject-Diffusion
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuning
☆311Updated last year
yukw777 / EILEV
EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties
☆131Updated 11 months ago
invictus717 / InteractiveVideo
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
☆129Updated last year
OSU-NLP-Group / MagicBrush
[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
☆377Updated 7 months ago
showlab / ShowAnything
☆82Updated 2 years ago
AILab-CVC / TaleCrafter
[SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple characters
☆263Updated last year
AILab-CVC / FreeNoise
[ICLR 2024] Code for FreeNoise based on VideoCrafter
☆419Updated last month
kabachuha / InfiNet
Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2…
☆85Updated 2 years ago
daooshee / HD-VG-130M
The HD-VG-130M Dataset
☆120Updated last year
QianWangX / InstructEdit
Implementation of InstructEdit
☆74Updated last year
weijiawu / ParaDiffusion
[IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model
☆104Updated 6 months ago
AILab-CVC / Make-Your-Video
[IEEE TVCG 2024] Customized Video Generation Using Textual and Structural Guidance
☆194Updated last year
aim-uofa / AutoStory
[IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
☆151Updated 10 months ago
lixirui142 / VidToMe
Official Pytorch Implementation for "VidToMe: Video Token Merging for Zero-Shot Video Editing" (CVPR 2024)
☆219Updated 8 months ago
JackChen890311 / Simple-Magic-Animate
A simple magic animate pipeline including densepose inference.
☆37Updated last year
microsoft / Peekaboo
Interactive Video Generation via Masked-Diffusion
☆82Updated last year
anonymous0x233 / ReuseAndDiffuse
Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
☆37Updated last year
mayuelala / FollowYourHandle
[WACV 2025] Follow-Your-Handle: This repo is the official implementation of "MagicStick: Controllable Video Editing via Control Handle Tr…
☆94Updated last year
YBYBZhang / VideoElevator
[AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …
☆160Updated last year
xuduo35 / MakeLongVideo
Implementation of long video generation
☆80Updated 2 years ago
Vchitect / VideoBooth
[CVPR2024] VideoBooth: Diffusion-based Video Generation with Image Prompts
☆303Updated last year
xiaoqian-shen / StoryGPT-V
[CVPR 2025] Official PyTorch implementation of StoryGPT-V
☆41Updated 3 months ago
PKU-YuanGroup / Open-Sora-Dataset
☆106Updated last year
yukw777 / VideoBLIP
Supercharged BLIP-2 that can handle videos
☆121Updated last year