Vyvo-Labs / VyvoTTSLinks
β170Updated last month
Alternatives and similar repositories for VyvoTTS
Users that are interested in VyvoTTS are comparing it to the libraries listed below
Sorting:
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β125Updated last month
- Open TTS models, built for streaming on the edgeβ43Updated 6 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β68Updated 3 weeks ago
- Collection of Open Source Speech Dataβ161Updated last week
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on Onβ¦β217Updated 4 months ago
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latencyβ139Updated this week
- β253Updated last month
- β283Updated 2 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLMβ280Updated 4 months ago
- Audio tokenization, in the fastest way possible!β53Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restorationβ185Updated 5 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformersβ57Updated 4 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolateβ284Updated 4 months ago
- Kyutai with an "eye"β219Updated 6 months ago
- β62Updated last year
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.β24Updated 6 months ago
- β304Updated 2 months ago
- β262Updated last year
- Official implementation of the TTS model Lina-Speechβ170Updated 8 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β212Updated 5 months ago
- β232Updated 4 months ago
- A simple, hackable text-to-speech system in PyTorch and MLXβ174Updated 2 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDβ¦β179Updated last week
- Liquid Audio - Speech-to-Speech audio models by Liquid AIβ101Updated this week
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximationβ128Updated 4 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.β124Updated 2 months ago
- An unofficial PyTorch implementation of VALL-Eβ88Updated 2 months ago
- StyleTTS 2 Optimized Training Forkβ33Updated 8 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.β42Updated 2 weeks ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).β85Updated this week