jaketae / storytellerLinks
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
☆534Updated 2 years ago
Alternatives and similar repositories for storyteller
Users that are interested in storyteller are comparing it to the libraries listed below
Sorting:
- WavJourney: Compositional Audio Creation with LLMs☆540Updated 2 years ago
- Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)☆262Updated 2 years ago
- TTS with The Massively Multilingual Speech (MMS) project☆235Updated last year
- Create GIFs and Videos using Stable Diffusion☆225Updated last year
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆587Updated 2 years ago
- The code for the bark-voicecloning model. Training and inference.☆709Updated 2 years ago
- ☆549Updated 2 years ago
- Unofficial demo app for CogVideo☆53Updated 3 years ago
- Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, inpaint and upscale4x.☆603Updated 2 years ago
- a web ui & api for 🤗 diffusers☆597Updated 2 years ago
- Diffusers / Stable Diffusion in docker with a REST API, supporting various models, pipelines & schedulers.☆204Updated 2 years ago
- extending stable diffusion prompts with suitable style cues using text generation☆179Updated 3 years ago
- Video generation tool for Stable Diffusion.☆33Updated 2 years ago
- A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.☆252Updated 10 months ago
- Text To Video Synthesis Colab☆1,516Updated last year
- A user-friendly tool for generating high-quality text prompts for AI image generation models like Midjourney, DALL-E, Stable Diffusion, a…☆164Updated 7 months ago
- App that leverages GPT-3 to facilitate new language listening and speaking practice.☆130Updated 2 years ago
- Learn to serve Stable Diffusion models on cloud infrastructure at scale. This Lightning App shows load-balancing, orchestrating, pre-prov…☆391Updated 2 years ago
- A family of diffusion models for text-to-audio generation.☆1,227Updated 6 months ago
- Remove text from AI-generated images☆307Updated last year
- ☆233Updated 2 years ago
- Deploy Your Own Stable Diffusion Service☆201Updated last year
- ☆536Updated 2 years ago
- Text prompt steered synthetic audio generators☆52Updated 9 months ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆794Updated last year
- ☆148Updated 2 years ago
- ☆63Updated 3 years ago
- TorToiSe fine-tuning with DLAS☆226Updated last year
- Site for sharing Bark voices☆51Updated 10 months ago
- Notebook and tools for end-to-end automation of music video production with generative AI☆216Updated 2 years ago