SocAIty / SpeechCraftLinks
π Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark
β67Updated 3 months ago
Alternatives and similar repositories for SpeechCraft
Users that are interested in SpeechCraft are comparing it to the libraries listed below
Sorting:
- Audio datasets, easier.β85Updated 2 years ago
- β101Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β72Updated 2 years ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocosβ46Updated last year
- β67Updated 6 months ago
- One-shot face animation using webcam, capable of running in real time.β38Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β81Updated 11 months ago
- GradioUI for TortoiseTTS voice generationβ34Updated 2 years ago
- Examples of using the llasa-tts models locallyβ180Updated 5 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloningβ145Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ157Updated last year
- β71Updated 2 months ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.β68Updated 11 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β122Updated 3 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ37Updated 4 months ago
- β165Updated 2 years ago
- A Gradio UI for XTTSv2 and RVC.β159Updated last year
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0β54Updated last year
- β99Updated last year
- Quick webui for audiocraftβ165Updated 6 months ago
- Oobabooga extension for Bark TTSβ118Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wildβ60Updated last year
- Windows-compatible Fast API implementation of VoiceCraft, the Zero-Shot Speech Editing and Text-to-Speech in the Wildβ21Updated last year
- β75Updated last year
- Site for sharing Bark voicesβ51Updated 6 months ago
- Full GUI Versionβ31Updated 2 years ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speechβ362Updated 9 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.β42Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sβ¦β52Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on β¦β101Updated 2 weeks ago