absadiki / easymmsLinks
A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
☆54Updated 2 years ago
Alternatives and similar repositories for easymms
Users that are interested in easymms are comparing it to the libraries listed below
Sorting:
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- A lightweight end-to-end text-to-speech model☆125Updated 9 months ago
- ☆355Updated last year
- Fine Tune the Style-TTS2 Voice Model☆263Updated 6 months ago
- Running the F5-TTS by ONNX Runtime☆184Updated last month
- ☆261Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆256Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆72Updated 5 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆231Updated last year
- Official implementation of the TTS model Lina-Speech☆175Updated 11 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- Python bindings for whisper.cpp☆305Updated this week
- Your one-stop solution for voice dataset creation☆128Updated 2 years ago
- ☆275Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆99Updated last year
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆411Updated last year
- 🐸 - A general purpose model trainer, as flexible as it gets☆230Updated last year
- Official Implementation of StyleTTS☆456Updated 11 months ago
- 🌻 VITS ONNX TTS server designed for fast inference 🔥☆129Updated 10 months ago
- SoTA open-source TTS☆117Updated 6 months ago
- ☆100Updated last year
- Open source inference code for Rev's model☆434Updated 7 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆97Updated 11 months ago
- Open models for Coqui STT☆148Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 6 months ago
- ☆158Updated 2 years ago
- Community framework for training tortoise☆44Updated 3 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆347Updated last year