absadiki / easymmsLinks
A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
☆52Updated 2 years ago
Alternatives and similar repositories for easymms
Users that are interested in easymms are comparing it to the libraries listed below
Sorting:
- A lightweight end-to-end text-to-speech model☆115Updated 4 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- Running the F5-TTS by ONNX Runtime☆161Updated last week
- ☆239Updated 3 weeks ago
- ☆260Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- Text To Speech Multilingual Support (+20 Language)☆47Updated 2 years ago
- Cantonese Text to Speech with VITS implementation☆31Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆97Updated 3 weeks ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆246Updated last year
- 🌻 VITS ONNX TTS server designed for fast inference 🔥☆127Updated 5 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆35Updated 3 months ago
- Speech Diarization for scrum automation☆108Updated last year
- Your one-stop solution for voice dataset creation☆120Updated last year
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆115Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆100Updated 9 months ago
- Open models for Coqui STT☆141Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- ☆62Updated 11 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆96Updated 9 months ago
- ☆156Updated 7 months ago
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆31Updated 7 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆65Updated 3 weeks ago
- ☆336Updated last year
- Official implementation of the TTS model Lina-Speech☆166Updated 6 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆34Updated last month
- openvino version of openai/whisper☆168Updated last year
- TTS support with GGML☆127Updated 2 weeks ago