nathanwang000 / genAI_storyteller
Create storybooks generated using generative AI models from using LLMs for text to Stable Diffusion for illustrations (maybe also use text to speech for narration).
☆20Updated last year
Alternatives and similar repositories for genAI_storyteller:
Users that are interested in genAI_storyteller are comparing it to the libraries listed below
- ✨ Experience the enchantment of Story Blocks: an open-source project merging AI text generation and image synthesis to create captivating…☆60Updated last year
- ☆79Updated last year
- ☆40Updated last year
- ☆44Updated last year
- Auto-Video maker handling many AI's☆10Updated last year
- A TTS extension for oobabooga text WebUI☆31Updated 11 months ago
- ☆17Updated last year
- ☆40Updated last year
- 1-click launcher for AUTOMATIC1111/stable-diffusion-webui with full SDXL 1.0 support.☆21Updated last year
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆44Updated 8 months ago
- ☆27Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆66Updated last year
- ☆28Updated last year
- A collection of handy helpers for AI art generation, AI writing and other experimental tools☆51Updated 6 months ago
- ☆83Updated 9 months ago
- Diffusion_TTS extension for booga☆67Updated 10 months ago
- ☆22Updated last year
- ☆22Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆45Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆59Updated 6 months ago
- ☆30Updated last year
- ☆54Updated last year
- This project aims to bring a more stable and user friendly check GPT interface designed to allow others to implement their own GPT prompt…☆12Updated last year
- Video Voiceover with gpt-4o-mini☆33Updated 6 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆95Updated last week
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆51Updated 10 months ago
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated last year
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- A simple framework for using a local Koboldcpp LLM to help with story-writing☆21Updated last year
- ☆24Updated last year