abdeladim-s / easymms
A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
☆51Updated last year
Related projects: ⓘ
- ☆62Updated 4 months ago
- ☆161Updated last month
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆41Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆119Updated 2 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆56Updated 4 months ago
- Speech Diarization for scrum automation☆94Updated last year
- Live-Transcription (STT) with Whisper PoC☆140Updated 3 months ago
- Python bindings for whisper.cpp☆150Updated this week
- Your one-stop solution for voice dataset creation☆106Updated 9 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆122Updated 4 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆103Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆66Updated 11 months ago
- TTS with The Massively Multilingual Speech (MMS) project☆224Updated 2 months ago
- A lightweight end-to-end text-to-speech model☆79Updated this week
- ☆278Updated 2 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆83Updated this week
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆194Updated 3 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆163Updated last week
- zero-shot voice conversion with in context learning☆135Updated this week
- Site for sharing Bark voices☆47Updated 2 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆51Updated 8 months ago
- Meta's "No Language Left Behind" models served as web app and REST API☆171Updated 3 weeks ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆122Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆124Updated 3 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆55Updated 2 weeks ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆29Updated last month
- We Speech Transcript based on LLM, in 300 lines of code.☆117Updated last month
- ☆74Updated 2 months ago
- ☆244Updated 6 months ago