abdeladim-s / easymms
A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
โ52Updated last year
Related projects โ
Alternatives and complementary repositories for easymms
- โ176Updated last month
- ๐ ๐ค Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningโ138Updated 4 months ago
- โ87Updated 6 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioโ66Updated last year
- โ73Updated last month
- ONNX-compatible Fast SeamlessM4TโMassively Multilingual & Multimodal Machine Translationโ40Updated last year
- VALL-E 2 reproductionโ87Updated 4 months ago
- Efficient approach to speaker diarization using voice characteristics extractionโ68Updated 6 months ago
- Faster Tortoise inference then Tortoise Fast Forkโ122Updated 7 months ago
- ๐ Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. ๐ง๐ฅ๐ Advanced audio processing.โ209Updated 5 months ago
- A ggml (C++) re-implementation of tortoise-ttsโ159Updated 3 months ago
- โ254Updated 8 months ago
- โ77Updated 4 months ago
- Your one-stop solution for voice dataset creationโ112Updated 11 months ago
- ONNX Inference of Pyannote Segmentationโ66Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPโฆโ83Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesโ84Updated 6 months ago
- The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)๐โ18Updated last year
- Text to speech alignment using CTC forced alignmentโ141Updated 3 weeks ago
- Barkify: an unoffical training implementation of Bark TTS by suno-aiโ126Updated last year
- โ296Updated 4 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,โฆโ43Updated last month
- C++ library for converting text to phonemes for Piperโ89Updated 8 months ago
- VoiceBox neural network implementationโ96Updated 3 months ago
- TorToiSe fine-tuning with DLASโ218Updated 3 months ago
- โ171Updated 11 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the โฆโ32Updated last week
- Pseudo Streaming SenseVoice with Hotwordsโ85Updated 2 weeks ago
- Official implementation of the TTS model Lina-Speechโ137Updated last week