abdeladim-s / easymms

A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project

☆52

Related projects ⓘ

Alternatives and complementary repositories for easymms

IIEleven11 / StyleTTS2FineTune
☆176Updated last month
sidharthrajaram / StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
☆138Updated 4 months ago
NeuralVox / StyleTTS2
☆87Updated 6 months ago
0417keito / VALL-E-X-Trainer-by-CustomData
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
☆66Updated last year
anhnh2002 / XTTSv2-Finetuning-for-New-Languages
☆73Updated last month
fabio-sim / Fast-SeamlessM4T-ONNX
ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation
☆40Updated last year
ex3ndr / supervoice-vall-e-2
VALL-E 2 reproduction
☆87Updated 4 months ago
KoljaB / WhoSpeaks
Efficient approach to speaker diarization using voice characteristics extraction
☆68Updated 6 months ago
manmay-nakhashi / tortoise-tts-fastest
Faster Tortoise inference then Tortoise Fast Fork
☆122Updated 7 months ago
davidmartinrius / speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
☆209Updated 5 months ago
balisujohn / tortoise.cpp
A ggml (C++) re-implementation of tortoise-tts
☆159Updated 3 months ago
PolyAI-LDN / pheme
☆254Updated 8 months ago
camenduru / coqui-XTTS-colab
☆77Updated 4 months ago
rioharper / VocalForge
Your one-stop solution for voice dataset creation
☆112Updated 11 months ago
pengzhendong / pyannote-onnx
ONNX Inference of Pyannote Segmentation
☆66Updated 2 months ago
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆83Updated last month
hedrergudene / asr-sd-pipeline
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
☆84Updated 6 months ago
epk2112 / fairseq_meta_mms_Google_Colab_implementation
The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)👇
☆18Updated last year
MahmoudAshraf97 / ctc-forced-aligner
Text to speech alignment using CTC forced alignment
☆141Updated 3 weeks ago
anyvoiceai / Barkify
Barkify: an unoffical training implementation of Bark TTS by suno-ai
☆126Updated last year
coqui-ai / xtts-streaming-server
☆296Updated 4 months ago
tonychenxyz / emoknob
This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…
☆43Updated last month
rhasspy / piper-phonemize
C++ library for converting text to phonemes for Piper
☆89Updated 8 months ago
ex3ndr / supervoice-voicebox
VoiceBox neural network implementation
☆96Updated 3 months ago
152334H / DL-Art-School
TorToiSe fine-tuning with DLAS
☆218Updated 3 months ago
WeberJulian / AI-voice-chat
☆171Updated 11 months ago
AGENDD / RWKV-ASR
This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …
☆32Updated last week
pengzhendong / streaming-sensevoice
Pseudo Streaming SenseVoice with Hotwords
☆85Updated 2 weeks ago
theodorblackbird / lina-speech
Official implementation of the TTS model Lina-Speech
☆137Updated last week