jaketae / storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
☆515Updated last year
Alternatives and similar repositories for storyteller:
Users that are interested in storyteller are comparing it to the libraries listed below
- WavJourney: Compositional Audio Creation with LLMs☆531Updated last year
- Text To Video Synthesis Colab☆1,496Updated 11 months ago
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusion☆1,250Updated 8 months ago
- Learn to serve Stable Diffusion models on cloud infrastructure at scale. This Lightning App shows load-balancing, orchestrating, pre-prov…☆392Updated last year
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆765Updated 7 months ago
- ☆694Updated 2 years ago
- Video generation tool for Stable Diffusion.☆33Updated last year
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024☆744Updated last year
- Finetune ModelScope's Text To Video model using Diffusers 🧨☆682Updated last year
- Create AI-Generated Video Tutorials with Character Animation and Slides!☆268Updated last year
- Simple prompt generator for Midjourney, DALLe, Stable and Disco Diffusion, Flux and etc.☆150Updated last month
- The code for the bark-voicecloning model. Training and inference.☆687Updated last year
- ☆147Updated last year
- Text2Cinemagraph: Text-Guided Synthesis of Eulerian Cinemagraphs [SIGGRAPH ASIA 2023]☆383Updated last year
- Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation…☆641Updated last year
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,953Updated 10 months ago
- Create GIFs and Videos using Stable Diffusion☆222Updated last year
- Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)☆253Updated last year
- ☆538Updated last year
- extending stable diffusion prompts with suitable style cues using text generation☆176Updated 2 years ago
- One-click Face Swapper and Restoration powered by insightface 🔥☆592Updated 10 months ago
- Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"☆995Updated last year
- Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach☆467Updated last year
- [ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction☆933Updated 4 months ago
- ☆100Updated last year
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆576Updated last year
- App that leverages GPT-3 to facilitate new language listening and speaking practice.☆130Updated last year
- ☆132Updated last year
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆69Updated 3 months ago
- ☆406Updated 10 months ago