SocAIty / SpeechCraftLinks
π Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark
β69Updated 7 months ago
Alternatives and similar repositories for SpeechCraft
Users that are interested in SpeechCraft are comparing it to the libraries listed below
Sorting:
- Audio datasets, easier.β86Updated 2 years ago
- Examples of using the llasa-tts models locallyβ182Updated 9 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on β¦β108Updated last month
- β72Updated 6 months ago
- β73Updated 10 months ago
- A Gradio UI for XTTSv2 and RVC.β161Updated last year
- β101Updated last year
- β170Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β123Updated 7 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β82Updated last year
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocosβ46Updated last year
- Oobabooga extension for Bark TTSβ120Updated 2 years ago
- Quick webui for audiocraftβ169Updated 10 months ago
- Diffusion_TTS extension for boogaβ69Updated 5 months ago
- GradioUI for TortoiseTTS voice generationβ33Updated 2 years ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ161Updated last year
- Site for sharing Bark voicesβ51Updated 10 months ago
- A simple extension that uses Bark Text-to-Speech for audio outputβ33Updated 2 years ago
- β100Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.β74Updated last year
- A Gradio UI for XTTSv2 and RVC.β66Updated last year
- A web search extension for Oobabooga's text-generation-webui (now with nougat)β72Updated last year
- Slightly improved official version for finetune xttsβ71Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ37Updated 8 months ago
- Gradio UI for YuEβ89Updated 10 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0β56Updated last year
- Webui for using XTTS and for finetuning itβ114Updated last year
- One-shot face animation using webcam, capable of running in real time.β41Updated last year
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.β42Updated 2 years ago
- XTTSv2 Extension for oobabooga text-generation-webuiβ156Updated 2 years ago