ShadowForests / VoiceToSpeechLinks
Live speech recognition to synthesized speech with hundreds of voices, TTS, language auto-translation, and socket support in-browser.
☆38Updated 11 months ago
Alternatives and similar repositories for VoiceToSpeech
Users that are interested in VoiceToSpeech are comparing it to the libraries listed below
Sorting:
- A Streamlit based web app which targets on converting voices into different languages (Hindi to English (for now)) keeping the voice in…☆8Updated 3 years ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- ☆72Updated last year
- List of repositories relevant to VITS.☆35Updated 2 years ago
- Heteronym to Phoneme Parser☆18Updated last year
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- Text prompt steered synthetic audio generators☆47Updated last month
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆53Updated 2 years ago
- Real-time end-to-end singing voice convertion☆22Updated 7 months ago
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- People who suffer from low vision, sight and visual impairment are not able to see words and letters in ordinary newsprint, books and mag…☆10Updated 4 years ago
- An even smaller speech recognizer / force aligner☆33Updated 5 months ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆73Updated last year
- ☆83Updated 11 months ago
- ☆18Updated 2 years ago
- A simple voice conversion tool☆17Updated 3 years ago
- text-to-audio-latent-diffusion☆37Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆35Updated last month
- Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE☆14Updated 2 years ago
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Community framework for training tortoise☆41Updated 2 years ago
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆25Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- On-device speaker diarization powered by deep learning☆47Updated this week
- SandyPanda-MLDL / -Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-ModelsEvaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models☆17Updated last year
- web based editor for subtitles and transcripts☆133Updated 9 months ago