absadiki / easymmsLinks
A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
β53Updated 2 years ago
Alternatives and similar repositories for easymms
Users that are interested in easymms are comparing it to the libraries listed below
Sorting:
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated last year
- Running the F5-TTS by ONNX Runtimeβ181Updated last month
- Faster Tortoise inference then Tortoise Fast Forkβ127Updated last year
- Fine Tune the Style-TTS2 Voice Modelβ256Updated 4 months ago
- β262Updated last year
- Python bindings for whisper.cppβ296Updated last week
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β254Updated last year
- β349Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ346Updated 11 months ago
- A lightweight end-to-end text-to-speech modelβ123Updated 8 months ago
- finetune llm part for spark-tts modelβ111Updated 7 months ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contaiβ¦β39Updated 7 months ago
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.β589Updated 2 years ago
- openvino version of openai/whisperβ176Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.β119Updated 2 years ago
- Official Implementation of StyleTTSβ453Updated 9 months ago
- Cantonese Text to Speech with VITS implementationβ36Updated 2 years ago
- Your one-stop solution for voice dataset creationβ127Updated last year
- F5-TTS ζ¨ηε ιοΌιεΊ¦ζεηΊ¦4εοΌβ114Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ97Updated last year
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformersβ35Updated 10 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β405Updated last year
- β549Updated last year
- Official implementation of the TTS model Lina-Speechβ170Updated 9 months ago
- β172Updated 10 months ago
- Efficient approach to speaker diarization using voice characteristics extractionβ104Updated 4 months ago
- ONNX Inference of Pyannote Segmentationβ95Updated 10 months ago
- Putting flows on top of neural transducers for better TTSβ64Updated 3 weeks ago
- Voice Conversion With Just Nearest Neighborsβ502Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcriptionβ159Updated last year