meokz / looking-to-listenLinks
Deep neural network (DNN) for noise reduction, removal of background music, and speech separation
☆173Updated 2 years ago
Alternatives and similar repositories for looking-to-listen
Users that are interested in looking-to-listen are comparing it to the libraries listed below
Sorting:
- The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.☆202Updated 3 years ago
 - A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated last year
 - Deep Neural Network for Speaker Count Estimation☆156Updated 5 years ago
 - ESPnet Model Zoo☆256Updated 2 years ago
 - VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆321Updated last year
 - Speech Denoising with Deep Feature Losses☆189Updated 5 years ago
 - PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆231Updated 3 years ago
 - Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆221Updated 2 years ago
 - Speech noise reduction which was generated using existing post-production techniques implemented in Python☆181Updated 3 years ago
 - A PyTorch implementation of DNN-based source separation.☆305Updated 3 years ago
 - VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆260Updated last year
 - Implementation code of non-parallel sequence-to-sequence VC☆248Updated 2 years ago
 - Audio Denoising with Deep Network Priors☆163Updated 5 years ago
 - Audio Source Separation Without Any Training Data.☆164Updated last year
 - ☆261Updated 2 years ago
 - A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Updated 4 years ago
 - The Cone of Silence:☆155Updated 3 years ago
 - Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆110Updated last year
 - python wrapper for rnnoise library☆48Updated 2 years ago
 - Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Updated 2 years ago
 - Deep learning based speech source separation using Pytorch☆319Updated 4 years ago
 - VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 3 years ago
 - A suite of speech signal processing tools☆241Updated last month
 - Python framework for Speech and Music Detection using Keras.☆107Updated 2 years ago
 - The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆272Updated 2 years ago
 - A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆67Updated 6 years ago
 - An open-source speech separation and enhancement library☆213Updated 5 years ago
 - Voice Activity Detection based on Deep Learning & TensorFlow☆369Updated 2 years ago
 - Multi-voice singing voice synthesis☆238Updated 2 years ago
 - Include some core functions and model to handle speech separation☆155Updated 4 years ago