jaketae / storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
β515Updated last year
Alternatives and similar repositories for storyteller:
Users that are interested in storyteller are comparing it to the libraries listed below
- Create GIFs and Videos using Stable Diffusionβ222Updated last year
- Finetune ModelScope's Text To Video model using Diffusers π§¨β681Updated last year
- A large-scale text-to-image prompt gallery dataset based on Stable Diffusionβ1,250Updated 8 months ago
- β694Updated 2 years ago
- β610Updated 2 years ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorchβ765Updated 7 months ago
- Diffusers / Stable Diffusion in docker with a REST API, supporting various models, pipelines & schedulers.β203Updated last year
- Create AI-Generated Video Tutorials with Character Animation and Slides!β268Updated last year
- Rich-Text-to-Image Generationβ776Updated last year
- WavJourney: Compositional Audio Creation with LLMsβ531Updated last year
- Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentationβ¦β641Updated last year
- Unoffical implement for [StyleDrop](https://arxiv.org/abs/2306.00983)β578Updated last year
- Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"β995Updated last year
- Code for Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approachβ467Updated last year
- π BARK INFINITY GUI CMD πΆ Powered Up Bark Text-prompted Generative Audio Modelβ1,004Updated last year
- [ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editingβ1,420Updated last year
- Learn to serve Stable Diffusion models on cloud infrastructure at scale. This Lightning App shows load-balancing, orchestrating, pre-provβ¦β392Updated last year
- The code for the bark-voicecloning model. Training and inference.β687Updated last year
- Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllabilityβ926Updated last year
- β100Updated last year
- General fine tuning for Stable Diffusionβ510Updated last year
- A family of diffusion models for text-to-audio generation.β1,150Updated 2 months ago
- Simple prompt generator for Midjourney, DALLe, Stable and Disco Diffusion, Flux and etc.β150Updated last month
- Text To Video Synthesis Colabβ1,495Updated 11 months ago
- β406Updated 10 months ago
- [SIGGRAPH Asia 2023] An interactive story visualization tool that support multiple charactersβ262Updated 11 months ago
- Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024β744Updated last year
- Remove text from AI-generated imagesβ280Updated 4 months ago
- [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"β1,137Updated last year
- extending stable diffusion prompts with suitable style cues using text generationβ176Updated 2 years ago