Li-JEN / PEL-accent-adaptaionLinks
The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"
☆13Updated 2 years ago
Alternatives and similar repositories for PEL-accent-adaptaion
Users that are interested in PEL-accent-adaptaion are comparing it to the libraries listed below
Sorting:
- ☆19Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated 2 years ago
- ☆22Updated last year
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27Updated 6 months ago
- ☆32Updated last year
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆62Updated 2 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆89Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Updated last year
- ☆45Updated 2 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12Updated last year
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆82Updated 2 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated last year
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆51Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Updated 2 years ago
- List of direct speech-to-speech translation papers.☆38Updated 2 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆45Updated 4 years ago
- ☆31Updated 2 years ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆29Updated 2 years ago
- Collection of works for evaluating (and analyzing) large audio-language models (LALMs)☆40Updated 4 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 7 months ago
- ☆26Updated last year
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Updated 2 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- MSP-Podcast Challenge Baseline Code☆30Updated last year
- Official implementation of MelHuBERT☆68Updated last year
- ☆37Updated 3 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Updated 2 years ago