dawmro / teller_of_talesLinks
Create narrated video story from book chapter using NLP, OpenAI and StableDiffusion.
☆18Updated last month
Alternatives and similar repositories for teller_of_tales
Users that are interested in teller_of_tales are comparing it to the libraries listed below
Sorting:
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆73Updated 2 years ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆49Updated 10 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆156Updated 2 years ago
- DeepFloyd IF web UI☆30Updated 2 years ago
- Video Diffusion WebUI: Text2Video + Image2Video + Video2Video WebUI☆66Updated last year
- ☆83Updated last year
- ☆17Updated 10 months ago
- ☆18Updated 2 years ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆61Updated last year
- Optimum version of a UI for Stable Diffusion, running on ONNX models for faster inference, working on most common GPU vendors: NVIDIA,AMD…☆26Updated 2 years ago
- Oobabooga extension for Bark TTS☆120Updated 2 years ago
- Integrate image generation capabilities to text-generation-webui using Stable Diffusion.☆58Updated last year
- ☆54Updated 2 years ago
- 🎵 LyricWave – AI Music Composer (Proof of Concept) 🎶 A personal project exploring automatic generation of unique MP4 songs. LyricWave b…☆39Updated 4 months ago
- A custom extension for AUTOMATIC1111/stable-diffusion-webui to extend rest APIs to do some local operations, using in StableStudio.☆48Updated 2 years ago
- ☆78Updated 2 years ago
- ☆72Updated 5 months ago
- A simple extension that uses Bark Text-to-Speech for audio output☆33Updated 2 years ago
- IP Adapter FaceID demo webui☆20Updated 2 years ago
- ☆24Updated last year
- Visual Clip Picker: Trimming Clips by Face Recognition☆47Updated 2 years ago
- Archived 🚧|🌻Building ChatBot with LLMs.🌻 | Using async requests. | 具有多 LLM 适应性 | 通用大语言模型代理端框架 |多人称全类型注解☆40Updated 2 years ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated 2 years ago
- This is an implementation of iperov's DeepFaceLab and DeepFaceLive in Stable Diffusion Web UI 1111 by AUTOMATIC1111.☆110Updated last year
- Quick webui for audiocraft☆169Updated 10 months ago
- Llama cute voice assistant☆27Updated 2 years ago
- ☆27Updated 2 years ago
- ☆51Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆108Updated 3 weeks ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 5 months ago