TTS-Research / PEL-TTS
☆14Updated last year
Related projects: ⓘ
- ☆11Updated 2 weeks ago
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆17Updated last year
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆14Updated 2 months ago
- Taiwanese Speech Synthesis with Tacotron2☆18Updated last year
- ☆11Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆12Updated last year
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆11Updated last month
- A spoken version of the textual story cloze benchmark☆12Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆20Updated last month
- ☆33Updated 5 months ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆32Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Updated 10 months ago
- A neural speech codec based on discrete WavLM representations☆14Updated 3 weeks ago
- ☆30Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆40Updated 2 months ago
- text to speech☆10Updated 6 months ago
- Collection of scripts from mHuBERT-147.☆21Updated 2 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 9 months ago
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆10Updated 3 weeks ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated 11 months ago
- GPT for FACodec☆13Updated 5 months ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆41Updated last year
- Phonemes and durations labeling based on whisper small☆12Updated 2 months ago
- wake-up word emotion recognition [APSIPA 2022]☆17Updated last year
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆10Updated 2 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated last month
- Survey on speech generation work.☆11Updated 9 months ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆11Updated 6 months ago