wannaphong / ttsmms
TTS with The Massively Multilingual Speech (MMS) project
☆229Updated 9 months ago
Alternatives and similar repositories for ttsmms:
Users that are interested in ttsmms are comparing it to the libraries listed below
- ☆230Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 11 months ago
- TorToiSe fine-tuning with DLAS☆218Updated 8 months ago
- ☆156Updated last year
- Audio datasets, easier.☆83Updated last year
- ☆36Updated last year
- ☆254Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆111Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- The code for the bark-voicecloning model. Training and inference.☆694Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated 11 months ago
- ☆83Updated 9 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated 8 months ago
- Oobabooga extension for Bark TTS☆118Updated last year
- ☆147Updated last year
- Fast TorToiSe inference (5x or your money back!)☆807Updated 9 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloning☆145Updated last year
- The Code shows How to Transcribe Audio to text using the fairseq_meta_mms (Google Colab Version)👇☆18Updated last year
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- ☆216Updated 3 weeks ago
- ☆166Updated last year
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- XTTSv2 Extension for oobabooga text-generation-webui☆152Updated last year
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆578Updated last year
- Transcription with speaker diarization pipeline☆92Updated last year
- openai/whisper + extra features☆89Updated 2 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆247Updated last year
- Performant and accurate speech recognition built on Pytorch☆253Updated 2 years ago
- ☆269Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago