lukaszliniewicz / VoiceCraft_APILinks
Windows-compatible Fast API implementation of VoiceCraft, the Zero-Shot Speech Editing and Text-to-Speech in the Wild
☆19Updated last year
Alternatives and similar repositories for VoiceCraft_API
Users that are interested in VoiceCraft_API are comparing it to the libraries listed below
Sorting:
- ☆67Updated 2 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆96Updated 2 weeks ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆79Updated 7 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆53Updated 11 months ago
- Diffusion_TTS extension for booga☆68Updated 11 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆34Updated 10 months ago
- An API for VoiceCraft.☆25Updated 11 months ago
- ☆50Updated 6 months ago
- SoTA open-source TTS for Audiobook and Podcast Generation☆17Updated this week
- ☆52Updated 2 months ago
- A SwarmUI extension that adds parameters for ReActor to the the generate tab☆22Updated last week
- ACE-Step: A Step Towards Music Generation Foundation Model☆40Updated 2 weeks ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆37Updated last year
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆40Updated 4 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆15Updated 3 months ago
- This project aims to bring a more stable and user friendly check GPT interface designed to allow others to implement their own GPT prompt…☆12Updated last year
- A web search extension for Oobabooga's text-generation-webui (now with nougat)☆74Updated 10 months ago
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated last year
- An easy-to-use image editor extension for Stable Diffusion Web UI☆45Updated last year
- ☆26Updated last year
- A plugin for Oobabooga TextUI that allows you to search multiple search engines. Initially we're using Google API or DuckDuckGo.☆16Updated 2 years ago
- Node to load LLM, it can be used to generate prompt or enhance them☆66Updated last month
- Supercharge your AI/LLM prompts☆79Updated 7 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 2 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆44Updated 2 months ago
- Bridging wrapper for llama-cpp-python within ComfyUI☆56Updated 11 months ago
- Easily download and archive content from Civitai☆63Updated 3 weeks ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- A collection of simple training GUIs for SD1.5 and SDXL.☆47Updated last year
- A very basic bot for generating Stable Diffusion images via the text-generation-webui☆73Updated last year