sieve-community / describe
Incredibly descriptive audiovisual summaries for videos
☆40Updated 7 months ago
Alternatives and similar repositories for describe:
Users that are interested in describe are comparing it to the libraries listed below
- A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.☆66Updated last year
- Gradio app to track objects in video and add visual effects☆16Updated 5 months ago
- ☆30Updated last year
- Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D…☆34Updated last month
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆146Updated 3 months ago
- ☆12Updated 11 months ago
- ☆12Updated 3 months ago
- ☆28Updated last year
- AIPE (AI Pipeline Engine) is a flexible and powerful tool for creating and executing complex AI workflows☆21Updated 6 months ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆115Updated 3 months ago
- 🍳 AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages 🤌🧑🍳☆21Updated 4 months ago
- ☆12Updated 4 months ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆14Updated last year
- Website source code for our ACM MM'23 paper "Hierarchical Masked 3D Diffusion Model for Video Outpainting".☆42Updated 10 months ago
- ☆46Updated last year
- ☆29Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆46Updated 3 months ago
- ☆70Updated 5 months ago
- Community ComfyUI workflows running on fal.ai☆56Updated 6 months ago
- [CVPR 2025] A Hierarchical Movie Level Dataset for Long Video Generation☆39Updated 2 months ago
- ☆25Updated last year
- ☆18Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated 11 months ago
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆55Updated 3 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆135Updated last month
- sd3 dreambooth lora training book, adapted from the diffusers doc☆43Updated 8 months ago
- ☆55Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆47Updated 2 weeks ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 3 months ago
- ☆36Updated last year