gallilmaimon / DISSC
Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730
☆128Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for DISSC
- Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation☆104Updated last year
- The official code for the SALMon🍣 benchmark☆40Updated 2 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆66Updated last month
- An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"☆133Updated last year
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆114Updated 5 months ago
- [AAAI 2024] Code for CTX-vec2wav in UniCATS☆122Updated 5 months ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆134Updated last year
- Official implementation of SpeechSplit2☆128Updated 2 years ago
- Reference-aware automatic speech evaluation toolkit☆109Updated 9 months ago
- ☆50Updated 9 months ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆112Updated 9 months ago
- This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"☆128Updated 11 months ago
- Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS☆162Updated 7 months ago
- CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆183Updated 6 months ago
- Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆104Updated last month
- An 16kHz implementation of HiFi-GAN for soft-vc.☆93Updated last year
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆23Updated 8 months ago
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆160Updated 4 months ago
- Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training☆130Updated last year
- ☆62Updated last year
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆139Updated last year
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆81Updated last year
- The open source code for SimpleSpeech series☆111Updated last month
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆144Updated 10 months ago
- This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)☆205Updated 4 months ago
- ☆62Updated 10 months ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆132Updated 3 months ago
- ☆69Updated last year
- ☆163Updated 2 years ago