kyegomez / StarlightVisionLinks
A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.
☆64Updated last year
Alternatives and similar repositories for StarlightVision
Users that are interested in StarlightVision are comparing it to the libraries listed below
Sorting:
- Incredibly descriptive audiovisual summaries for videos☆41Updated last year
- Implementation of the premier Text to Video model from OpenAI☆56Updated 8 months ago
- ☆31Updated last year
- ☆79Updated last year
- ☆204Updated last year
- ☆25Updated last year
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort☆152Updated 8 months ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆155Updated last year
- ☆46Updated last year
- ☆55Updated last year
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- Website source code for our ACM MM'23 paper "Hierarchical Masked 3D Diffusion Model for Video Outpainting".☆41Updated last year
- Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.☆87Updated 2 years ago
- Stable Fashion: A prompt based virtual try on repository☆89Updated 2 years ago
- ☆25Updated last year
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- ☆29Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆128Updated last year
- Retrieval-Augmented Video Generation for Telling a Story☆258Updated last year
- ☆28Updated last year
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆51Updated 6 months ago
- Code for Text2Performer. Paper: Text2Performer: Text-Driven Human Video Generation☆328Updated last year
- ☆114Updated last year
- ☆68Updated 2 years ago
- ☆54Updated last year
- ☆13Updated last year
- Finetune any model on HF in less than 30 seconds☆57Updated 2 weeks ago
- ☆63Updated 2 years ago
- General video interaction platform based on LLMs, including Video ChatGPT☆252Updated 2 years ago
- The Next Generation Multi-Modality Superintelligence☆70Updated 11 months ago