nathanwang000 / genAI_storytellerLinks
Create storybooks generated using generative AI models from using LLMs for text to Stable Diffusion for illustrations (maybe also use text to speech for narration).
☆20Updated last year
Alternatives and similar repositories for genAI_storyteller
Users that are interested in genAI_storyteller are comparing it to the libraries listed below
Sorting:
- ✨ Experience the enchantment of Story Blocks: an open-source project merging AI text generation and image synthesis to create captivating…☆62Updated last year
- Ai generated music video with Riffusion and Gradio☆21Updated 2 years ago
- ☆79Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆71Updated 2 years ago
- Video Diffusion WebUI: Text2Video + Image2Video + Video2Video WebUI☆67Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆64Updated 9 months ago
- A TTS extension for oobabooga text WebUI☆32Updated last year
- Llama cute voice assistant☆28Updated last year
- 1-click launcher for AUTOMATIC1111/stable-diffusion-webui with full SDXL 1.0 support.☆21Updated last year
- ☆27Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆100Updated last week
- ☆83Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆45Updated 3 months ago
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆74Updated last year
- ☆40Updated last year
- ☆92Updated 8 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆54Updated last year
- Diffusion_TTS extension for booga☆67Updated last year
- ☆18Updated last year
- DeepFloyd IF web UI☆30Updated 2 years ago
- A simple, modular, customizable app to help you generate prompts quickly and easily for Stable Diffusion, Midjourney, and Dall-E 2.☆56Updated 2 years ago
- ☆31Updated last year
- A simple extension that uses Bark Text-to-Speech for audio output☆34Updated last year
- ☆73Updated last year
- ☆40Updated last year
- Porting BabyAGI to Oobabooba.☆31Updated 2 years ago
- A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.☆64Updated last year
- ☆231Updated last year
- An interactive storybook built with the help of ChatGPT and Stable Diffusion.☆14Updated 2 years ago
- ☆39Updated last year