dwgnr / speech-conversion
Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE
☆13Updated last year
Related projects: ⓘ
- SandyPanda-MLDL / -Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-ModelsEvaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models☆11Updated last year
- Code for Interspeech2022 paper DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion☆13Updated last year
- ☆22Updated 3 years ago
- Nonparallel Emotional Speech Conversion with MUNIT. Introduction: This is a tensorflow implementation of paper(https://arxiv.org/pdf/1811…☆14Updated 2 years ago
- I-Vector Speaker recognition system implemented with MSRIT in matlab☆14Updated 8 years ago
- Voice Alignment and Conversion with Neural Networks and the WORLD codec.☆20Updated 5 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆13Updated 2 weeks ago
- A probabilistic scoring backend for length-normalized embeddings.☆10Updated 4 months ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆25Updated 5 years ago
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆101Updated last year
- ☆24Updated last year
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆20Updated 3 years ago
- Tensorflow implementation of VQVAE for voice conversion☆12Updated 6 years ago
- ☆39Updated last year
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆40Updated 2 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆31Updated last year
- Script to perform statistical significance test between ASR hypotheses.☆19Updated 7 years ago
- This repository is the official implementation of "Unimodal Aggregation for CTC-based Speech Recognition".☆13Updated 9 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆42Updated this week
- Audio based speaker diarization☆16Updated 5 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆15Updated 2 years ago
- ☆47Updated 3 months ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 3 years ago
- ☆18Updated 3 months ago
- ☆26Updated last year
- A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization☆62Updated 2 weeks ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆23Updated last year
- ☆86Updated 3 years ago
- ☆11Updated 8 months ago
- Dual-Path Attention and Recurrent Network for speech separation☆15Updated last week