jaketae / storytellerLinks
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
☆528Updated 2 years ago
Alternatives and similar repositories for storyteller
Users that are interested in storyteller are comparing it to the libraries listed below
Sorting:
- Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)☆263Updated 2 years ago
- ☆546Updated 2 years ago
- Create GIFs and Videos using Stable Diffusion☆224Updated last year
- Diffusers / Stable Diffusion in docker with a REST API, supporting various models, pipelines & schedulers.☆202Updated last year
- TTS with The Massively Multilingual Speech (MMS) project☆235Updated last year
- Learn to serve Stable Diffusion models on cloud infrastructure at scale. This Lightning App shows load-balancing, orchestrating, pre-prov…☆391Updated last year
- WavJourney: Compositional Audio Creation with LLMs☆539Updated last year
- App that leverages GPT-3 to facilitate new language listening and speaking practice.☆130Updated 2 years ago
- a web ui & api for 🤗 diffusers☆593Updated 2 years ago
- A user-friendly tool for generating high-quality text prompts for AI image generation models like Midjourney, DALL-E, Stable Diffusion, a…☆157Updated 2 months ago
- Video generation tool for Stable Diffusion.☆33Updated 2 years ago
- extending stable diffusion prompts with suitable style cues using text generation☆178Updated 2 years ago
- Unofficial Fastapi implementation of Stable-Diffusion API☆83Updated 2 years ago
- ☆697Updated 2 years ago
- A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.☆249Updated 5 months ago
- Text To Video Synthesis Colab☆1,515Updated last year
- ☆149Updated 2 years ago
- Turn text into video using Stable Diffusion and Google FILM☆42Updated 2 years ago
- Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach☆468Updated last year
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆780Updated last year
- The code for the bark-voicecloning model. Training and inference.☆704Updated last year
- Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, inpaint and upscale4x.☆605Updated 2 years ago
- Making an AI-generated music video from any song with Wav2CLIP and VQGAN-CLIP☆243Updated 3 years ago
- Unofficial demo app for CogVideo☆53Updated 3 years ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆584Updated 2 years ago
- Finetune ModelScope's Text To Video model using Diffusers 🧨☆686Updated last year
- ☆135Updated 2 years ago
- ☆96Updated 2 years ago
- Remove text from AI-generated images☆295Updated 10 months ago
- Deploy Your Own Stable Diffusion Service☆201Updated 11 months ago