ShadowForests / VoiceToSpeechLinks
Live speech recognition to synthesized speech with hundreds of voices, TTS, language auto-translation, and socket support in-browser.
☆43Updated last year
Alternatives and similar repositories for VoiceToSpeech
Users that are interested in VoiceToSpeech are comparing it to the libraries listed below
Sorting:
- ☆75Updated last year
- Real-time end-to-end singing voice convertion☆23Updated last year
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆72Updated 8 months ago
- Google collab for testing SoftVC VITS Singing Voice Conversion for AI capable of changing the singer within music files.☆12Updated 2 years ago
- Retrieval-based-Voice-Conversion ( RVC ) modified and enhanced by codename;0☆13Updated last year
- On-device noise suppression powered by deep learning☆82Updated 2 weeks ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆87Updated 2 years ago
- Fork of AudioLDM as a TuneFlow plugin☆44Updated 2 years ago
- ☆33Updated 9 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Resources on AI applications in the music domain☆20Updated 4 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆49Updated 10 months ago
- ☆101Updated last year
- ☆11Updated 2 years ago
- Speech AI training and inference tools☆36Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- GradioUI for TortoiseTTS voice generation☆33Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆15Updated 4 years ago
- A fast MP3 decoder for python, using minimp3☆30Updated 3 years ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆23Updated last year
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated last year
- An even smaller speech recognizer / force aligner☆37Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated 3 months ago
- Auto-Video maker handling many AI's☆11Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 4 months ago
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆54Updated 2 years ago
- RVC Inference with multiple model and huggingface support☆110Updated 3 months ago
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆88Updated last year
- Your one-stop solution for voice dataset creation☆128Updated 2 years ago