SocAIty / SpeechCraft
π Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark
β41Updated 2 months ago
Related projects β
Alternatives and complementary repositories for SpeechCraft
- Adds a web API to RVC to infer via json requestsβ17Updated 4 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ30Updated 2 weeks ago
- β51Updated 2 months ago
- Audio datasets, easier.β83Updated last year
- β87Updated 6 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β64Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ138Updated 4 months ago
- β145Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sβ¦β50Updated 6 months ago
- Faster Tortoise inference then Tortoise Fast Forkβ122Updated 7 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animationβ39Updated 7 months ago
- β77Updated 4 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β115Updated 8 months ago
- β54Updated 10 months ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocosβ45Updated 4 months ago
- Diffusion_TTS extension for boogaβ63Updated 4 months ago
- Oobabooga extension for Bark TTSβ118Updated 11 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on β¦β73Updated last week
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.β41Updated 9 months ago
- One-shot face animation using webcam, capable of running in real time.β31Updated 5 months ago
- Slightly improved official version for finetune xttsβ236Updated 3 weeks ago
- β25Updated 7 months ago
- Hard Reload oobabooga text WebUI extensionsβ15Updated last year
- A simple extension that uses Bark Text-to-Speech for audio outputβ35Updated last year
- β93Updated 3 months ago
- A TTS extension for oobabooga text WebUIβ26Updated 6 months ago
- GradioUI for TortoiseTTS voice generationβ34Updated last year
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0β46Updated 5 months ago
- Collection of the best Applio plugins.β19Updated 2 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β71Updated last month