absadiki / easymmsLinks
A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
β54Updated 2 years ago
Alternatives and similar repositories for easymms
Users that are interested in easymms are comparing it to the libraries listed below
Sorting:
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated last year
- Fine Tune the Style-TTS2 Voice Modelβ264Updated 6 months ago
- β261Updated last year
- Running the F5-TTS by ONNX Runtimeβ188Updated 2 months ago
- Official Implementation of StyleTTSβ456Updated 11 months ago
- β357Updated last year
- TTS with The Massively Multilingual Speech (MMS) projectβ233Updated last year
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β257Updated last year
- Official implementation of the TTS model Lina-Speechβ175Updated last year
- Your one-stop solution for voice dataset creationβ128Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extractionβ105Updated 6 months ago
- ONNX-compatible Fast SeamlessM4TβMassively Multilingual & Multimodal Machine Translationβ43Updated 2 years ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.β589Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated last year
- Python bindings for whisper.cppβ313Updated last week
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3β431Updated last year
- Putting flows on top of neural transducers for better TTSβ64Updated last month
- A lightweight end-to-end text-to-speech modelβ125Updated 10 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ348Updated last year
- β186Updated last year
- β385Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.β356Updated 2 years ago
- F5-TTS ζ¨ηε ιοΌιεΊ¦ζεηΊ¦4εοΌβ120Updated last year
- openvino version of openai/whisperβ180Updated 2 years ago
- ONNX Inference of Pyannote Segmentationβ97Updated last year
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β411Updated last year
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformersβ36Updated last year
- β175Updated 2 years ago
- Voice Conversion With Just Nearest Neighborsβ507Updated last year
- A ggml (C++) re-implementation of tortoise-ttsβ194Updated last year