kyegomez / StarlightVisionLinks
A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.
☆64Updated 2 years ago
Alternatives and similar repositories for StarlightVision
Users that are interested in StarlightVision are comparing it to the libraries listed below
Sorting:
- Incredibly descriptive audiovisual summaries for videos☆41Updated last year
- ☆31Updated 2 years ago
- Implementation of the premier Text to Video model from OpenAI☆56Updated last year
- ☆78Updated 2 years ago
- ☆55Updated 2 years ago
- ☆47Updated last year
- ☆208Updated 2 weeks ago
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆150Updated last year
- ☆12Updated last year
- ☆61Updated 2 years ago
- ☆26Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆132Updated 2 years ago
- Stable Fashion: A prompt based virtual try on repository☆89Updated 3 years ago
- Retrieval-Augmented Video Generation for Telling a Story☆259Updated 2 years ago
- Gradio app to track objects in video and add visual effects☆17Updated 6 months ago
- [TMLR23] Official implementation of UnIVAL: Unified Model for Image, Video, Audio and Language Tasks.☆232Updated 2 years ago
- ☆25Updated 2 years ago
- ☆29Updated 2 years ago
- ☆100Updated 2 years ago
- ☆13Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆70Updated last year
- General video interaction platform based on LLMs, including Video ChatGPT☆256Updated 2 years ago
- An open source, layer-based web interface for Collage Diffusion - use a familiar Photoshop-like interface and let the AI harmonize the de…☆68Updated 2 years ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆51Updated 11 months ago
- ☆29Updated 2 years ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆158Updated 2 years ago
- ☆25Updated 2 years ago
- Make-A-Video Latent Diffusion Model☆19Updated 2 years ago
- LCM LoRA☆35Updated 4 months ago
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.☆199Updated last year