sieve-community / describe
Incredibly descriptive audiovisual summaries for videos
☆40Updated 8 months ago
Alternatives and similar repositories for describe:
Users that are interested in describe are comparing it to the libraries listed below
- A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.☆66Updated last year
- ☆30Updated last year
- ☆29Updated last year
- Gradio app to track objects in video and add visual effects☆16Updated 6 months ago
- ☆12Updated 4 months ago
- ☆25Updated last year
- ☆28Updated last year
- A minimalistic, hackable code base to finetune Wan video generation model☆37Updated last week
- ☆12Updated 5 months ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- ☆46Updated last year
- Website source code for our ACM MM'23 paper "Hierarchical Masked 3D Diffusion Model for Video Outpainting".☆42Updated 11 months ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Fine-tune of Florence-2 for shot categorization.☆22Updated 3 weeks ago
- Community ComfyUI workflows running on fal.ai☆57Updated 7 months ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆118Updated 4 months ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆41Updated last week
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆150Updated 4 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆85Updated last year
- ☆24Updated last year
- sd3 dreambooth lora training book, adapted from the diffusers doc☆44Updated 9 months ago
- ☆12Updated last year
- LCM LoRA☆38Updated last year
- Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D…☆35Updated last month
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆126Updated 4 months ago
- ☆78Updated last year
- Gradio UI for a Cog API☆66Updated 11 months ago
- AIPE (AI Pipeline Engine) is a flexible and powerful tool for creating and executing complex AI workflows☆21Updated 7 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated last month
- ☆55Updated last year