SocAIty / SpeechCraftLinks
π Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark
β66Updated last month
Alternatives and similar repositories for SpeechCraft
Users that are interested in SpeechCraft are comparing it to the libraries listed below
Sorting:
- β101Updated last year
- Audio datasets, easier.β84Updated 2 years ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ37Updated 3 months ago
- β67Updated 5 months ago
- β98Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β122Updated 2 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β80Updated 10 months ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ159Updated last year
- GradioUI for TortoiseTTS voice generationβ34Updated last year
- Examples of using the llasa-tts models locallyβ179Updated 4 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloningβ146Updated last year
- β75Updated last year
- Oobabooga extension for Bark TTSβ119Updated last year
- β70Updated 3 weeks ago
- β166Updated 2 years ago
- β83Updated last year
- β68Updated 4 months ago
- β169Updated last year
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.β41Updated last year
- A Gradio UI for XTTSv2 and RVC.β160Updated last year
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UIβ69Updated 2 years ago
- XTTSv2 Extension for oobabooga text-generation-webuiβ155Updated last year
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speechβ358Updated 8 months ago
- Adds a web API to RVC to infer via json requestsβ27Updated last year
- β27Updated 2 years ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.β68Updated 10 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0β54Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on β¦β100Updated 2 weeks ago
- Diffusion_TTS extension for boogaβ66Updated last year
- β149Updated 2 years ago