jaketae / storytellerLinks
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
☆525Updated last year
Alternatives and similar repositories for storyteller
Users that are interested in storyteller are comparing it to the libraries listed below
Sorting:
- Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)☆261Updated 2 years ago
- WavJourney: Compositional Audio Creation with LLMs☆538Updated last year
- Create GIFs and Videos using Stable Diffusion☆224Updated last year
- ☆543Updated last year
- TTS with The Massively Multilingual Speech (MMS) project☆233Updated last year
- App that leverages GPT-3 to facilitate new language listening and speaking practice.☆131Updated 2 years ago
- a web ui & api for 🤗 diffusers☆592Updated 2 years ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆582Updated 2 years ago
- The code for the bark-voicecloning model. Training and inference.☆703Updated last year
- extending stable diffusion prompts with suitable style cues using text generation☆176Updated 2 years ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆776Updated 11 months ago
- A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.☆249Updated 4 months ago
- Video generation tool for Stable Diffusion.☆33Updated 2 years ago
- ☆231Updated last year
- Text To Video Synthesis Colab☆1,514Updated last year
- Audio datasets, easier.☆84Updated last year
- Learn to serve Stable Diffusion models on cloud infrastructure at scale. This Lightning App shows load-balancing, orchestrating, pre-prov…☆391Updated last year
- Create AI-Generated Video Tutorials with Character Animation and Slides!☆281Updated last year
- Diffusers / Stable Diffusion in docker with a REST API, supporting various models, pipelines & schedulers.☆203Updated last year
- TorToiSe fine-tuning with DLAS☆225Updated 11 months ago
- ☆36Updated 2 years ago
- A family of diffusion models for text-to-audio generation.☆1,179Updated 6 months ago
- ☆696Updated 2 years ago
- Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach☆466Updated last year
- Create amazing Stable Diffusion prompts with minimal prompt knowledge. A vicuna based prompt engineering tool for stable diffusion☆90Updated 2 years ago
- Text prompt steered synthetic audio generators☆47Updated 3 months ago
- ☆149Updated 2 years ago
- Unofficial Fastapi implementation of Stable-Diffusion API☆82Updated 2 years ago
- ☆135Updated last year
- Add caption to any video☆199Updated last year