SocAIty / SpeechCraft
π Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark
β57Updated last week
Alternatives and similar repositories for SpeechCraft:
Users that are interested in SpeechCraft are comparing it to the libraries listed below
- β58Updated 5 months ago
- Adds a web API to RVC to infer via json requestsβ20Updated 7 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sβ¦β52Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β76Updated 4 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ32Updated 3 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.β42Updated last year
- GradioUI for TortoiseTTS voice generationβ34Updated last year
- Audio datasets, easier.β82Updated last year
- β94Updated 9 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β65Updated last year
- A Gradio UI for XTTSv2 and RVC.β67Updated 4 months ago
- Collection of the best Applio plugins.β29Updated 5 months ago
- A Gradio UI for XTTSv2 and RVC.β156Updated 8 months ago
- Diffusion_TTS extension for boogaβ65Updated 7 months ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ150Updated 7 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficientβ¦β44Updated last week
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animationβ39Updated 10 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on β¦β81Updated 3 weeks ago
- Slightly improved official version for finetune xttsβ70Updated 4 months ago
- Full GUI Versionβ30Updated last year
- AI powered speech denoising and enhancement. Adapted for windows and optimizedβ79Updated 7 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloningβ143Updated last year
- Oobabooga extension for Bark TTSβ118Updated last year
- Windows-compatible Fast API implementation of VoiceCraft, the Zero-Shot Speech Editing and Text-to-Speech in the Wildβ20Updated 9 months ago
- XTTSv2 Extension for oobabooga text-generation-webuiβ151Updated last year
- β65Updated 4 months ago
- β99Updated 6 months ago
- A TTS extension for oobabooga text WebUIβ29Updated 9 months ago
- A UI for the Piper TTSβ79Updated 5 months ago