Vyvo-Labs / VyvoTTSLinks
β186Updated last week
Alternatives and similar repositories for VyvoTTS
Users that are interested in VyvoTTS are comparing it to the libraries listed below
Sorting:
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β127Updated 2 months ago
- Collection of Open Source Speech Dataβ161Updated 3 weeks ago
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latencyβ159Updated last week
- Liquid Audio - Speech-to-Speech audio models by Liquid AIβ206Updated 3 weeks ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on Onβ¦β219Updated 5 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolateβ289Updated 4 months ago
- Open TTS models, built for streaming on the edgeβ43Updated 7 months ago
- LongCat Audio Tokenizer and Detokenizerβ112Updated last week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β68Updated this week
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLMβ286Updated 5 months ago
- Audio tokenization, in the fastest way possible!β53Updated last year
- β284Updated 3 months ago
- β272Updated last month
- Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representationβ265Updated last week
- β314Updated 3 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β213Updated 6 months ago
- VoiceHub: A Unified Inference Interface for TTS Modelsβ56Updated 2 weeks ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.β102Updated 2 weeks ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GPβ¦β36Updated 7 months ago
- Kyutai with an "eye"β222Updated 6 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationβ186Updated 6 months ago
- β234Updated 5 months ago
- A lightweight end-to-end text-to-speech modelβ123Updated 8 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β126Updated 3 months ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximationβ131Updated 5 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.β24Updated 7 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDβ¦β183Updated last month
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformersβ57Updated 5 months ago
- Official implementation of the TTS model Lina-Speechβ170Updated 9 months ago
- A simple, hackable text-to-speech system in PyTorch and MLXβ176Updated 2 months ago