aminEdraki / py-intelligibility
Python implementation of a few speech intelligibility prediction algorithms
☆10Updated 3 months ago
Related projects: ⓘ
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated 11 months ago
- ☆18Updated 10 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated 6 months ago
- ☆13Updated 11 months ago
- Da - ECHO - RetrievAl - daTasEt☆22Updated 2 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- ☆15Updated 3 years ago
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆37Updated 2 years ago
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆20Updated 3 years ago
- A python implementation of Speech intelligibility in bits (SIIB)☆24Updated 2 years ago
- ☆11Updated last year
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆17Updated last year
- Deep Speech Distances PyTorch☆27Updated 2 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆13Updated 2 weeks ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆29Updated 3 years ago
- ☆18Updated 2 years ago
- ☆47Updated 3 months ago
- ☆57Updated last year
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆21Updated 3 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆49Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated last month
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆14Updated 10 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 9 months ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44Updated last year
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆22Updated last year
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆23Updated 3 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆28Updated 3 years ago
- This repository contains the audio samples and the source code that accompany the paper: "MixCycle: Unsupervised Speech Separation via Cy…☆23Updated last year
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆10Updated 2 weeks ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆40Updated last month