Berkeley-Speech-Group / RT-VCLinks
☆15Updated 4 months ago
Alternatives and similar repositories for RT-VC
Users that are interested in RT-VC are comparing it to the libraries listed below
Sorting:
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆17Updated last year
- Speech Resynthesis and Language Modeling☆26Updated 2 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆22Updated 11 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Updated 4 months ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆23Updated last year
- DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-to-Speech☆38Updated 3 weeks ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Updated 2 years ago
- Digital Speech Processing in PyTorch.☆14Updated 3 years ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆38Updated 9 months ago
- ☆13Updated 5 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Updated 2 years ago
- Streaming Vocos☆29Updated 2 months ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Updated last year
- Just another FastSpeech 2 but cleaner code :)☆27Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆28Updated 11 months ago
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆39Updated 2 weeks ago
- A toolkit dedicate for speech evaluation.☆21Updated 11 months ago
- ☆13Updated 9 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Updated last year
- ☆11Updated 3 years ago
- My vocoder experiments☆31Updated last month
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Updated last year
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12Updated last year
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆48Updated 2 weeks ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆21Updated last month
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆20Updated 2 weeks ago
- ☆16Updated 11 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago