skirdey / voicerestoreLinks
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
☆196Updated 8 months ago
Alternatives and similar repositories for voicerestore
Users that are interested in voicerestore are comparing it to the libraries listed below
Sorting:
- ☆261Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆218Updated 8 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆325Updated 3 weeks ago
- ☆294Updated 5 months ago
- Fast audio super resolution from 16khz to 48khz.☆167Updated this week
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆131Updated 4 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆127Updated 5 months ago
- ☆275Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆138Updated 3 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated 2 years ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆187Updated last year
- ☆385Updated last year
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆341Updated 5 months ago
- ☆345Updated 3 months ago
- Official implementation of the TTS model Lina-Speech☆175Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated 2 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- Very fast, accurate speaker diarization☆203Updated last week
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency☆181Updated 2 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆292Updated 7 months ago
- A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.☆414Updated 3 months ago
- Collection of Open Source Speech Data☆164Updated 3 months ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆217Updated 8 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆46Updated 3 months ago
- Real-time Speech-Text Foundation Model Toolkit (wip)☆249Updated 9 months ago
- ☆62Updated last year
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆226Updated 7 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆411Updated last year
- ☆339Updated 4 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆80Updated last year