nii-yamagishilab / vctk-silence-labels
☆25Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for vctk-silence-labels
- ☆27Updated last year
- ☆22Updated 7 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆49Updated 2 weeks ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆82Updated 2 months ago
- ☆47Updated last week
- ☆87Updated 2 years ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆57Updated 3 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆39Updated last week
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆37Updated 3 years ago
- ☆30Updated last year
- ☆64Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- ☆48Updated 5 months ago
- ☆12Updated last year
- ☆29Updated last year
- ☆62Updated last year
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆28Updated last year
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆27Updated last year
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- ☆33Updated 2 years ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆78Updated 4 months ago
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆24Updated last month
- ☆48Updated last year
- ☆22Updated 2 years ago
- Implementation of SpatialCodec.☆54Updated last year
- ☆21Updated 6 months ago
- ☆67Updated 3 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆34Updated 11 months ago