HJ-Ok / AudioBERT
AudioBERT 📢 : Audio Knowledge Augmented Language Model
☆14Updated this week
Related projects: ⓘ
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆22Updated this week
- Codebase and project page for EDMSound☆29Updated 10 months ago
- Collection of scripts from mHuBERT-147.☆21Updated 2 months ago
- ☆21Updated last year
- ☆35Updated 3 weeks ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆13Updated last month
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆45Updated 2 months ago
- ☆33Updated 5 months ago
- ☆28Updated this week
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆14Updated last year
- GPT for FACodec☆13Updated 5 months ago
- Zero-Shot Emotion Style Transfer☆33Updated 5 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆50Updated 10 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 7 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆40Updated 2 months ago
- ☆27Updated 6 months ago
- 4G GPU & 10 Minutes for train☆12Updated last year
- Supervoice diffusion enhance☆24Updated 2 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Unofficial implementation of wavenext vocoder☆28Updated 3 weeks ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆107Updated last month
- An unofficial PyTorch implementation of VALL-E☆68Updated last week
- ☆58Updated 10 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆14Updated 3 weeks ago
- ☆48Updated last month
- GPT-style network for phonemization with durations of text☆61Updated 6 months ago
- ☆37Updated 3 weeks ago
- Unsupervised Rhythm Modeling for Voice Conversion☆78Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆18Updated 2 months ago
- ☆12Updated 9 months ago