ShadowForests / VoiceToSpeechLinks
Live speech recognition to synthesized speech with hundreds of voices, TTS, language auto-translation, and socket support in-browser.
β43Updated last year
Alternatives and similar repositories for VoiceToSpeech
Users that are interested in VoiceToSpeech are comparing it to the libraries listed below
Sorting:
- π¬ "Realtime" voice transcription and cloning using ElevenLabs's API.β54Updated 2 years ago
- β75Updated last year
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)β87Updated 2 years ago
- GradioUI for TortoiseTTS voice generationβ33Updated 2 years ago
- A curated list of awesome OpenAI's Whisperβ102Updated 2 years ago
- text-to-audio-latent-diffusionβ37Updated 2 years ago
- Coqui AI TTS pluginβ85Updated 6 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ100Updated last year
- Music remixer based on MusicGen-Chordβ102Updated 2 years ago
- β18Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creationsβ49Updated 10 months ago
- Make Kanye sing any song ya want π€π₯β25Updated 2 years ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.β72Updated 8 months ago
- β¨ A real-time voice changer application using WebSockets and ONNX/TensorFlow/PyTorchβ49Updated last year
- On-device speaker recognition engine powered by deep learningβ39Updated last week
- Listen to any audio stream on your machine and print out the transcribed or translated audio.β119Updated 2 years ago
- Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create aβ¦β46Updated 2 years ago
- Implements ML audio separation algorithm on audio from YouTube or Spotify resulting in "stems" for download (e.g. vocals, drums, bass) inβ¦β34Updated last month
- Fork of AudioLDM as a TuneFlow pluginβ43Updated 2 years ago
- A python library to find differences between audio and transcriptionsβ19Updated 2 years ago
- Google collab for testing SoftVC VITS Singing Voice Conversion for AI capable of changing the singer within music files.β12Updated 2 years ago
- A simple voice conversion toolβ19Updated 3 years ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocosβ46Updated last year
- On-device Speech-to-Index engine powered by deep learningβ37Updated 9 months ago
- An even smaller speech recognizer / force alignerβ37Updated last year
- The Open Source AI Musical Toolkitβ46Updated last week
- List of repositories relevant to VITS.β36Updated 2 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationβ25Updated 3 years ago
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.β66Updated 3 months ago
- Code for OpenAI Whisper Web App Demoβ93Updated 3 years ago