Real-time Speech Separation, Noise Suppression & Speaker Recognition
☆18Apr 17, 2019Updated 6 years ago
Alternatives and similar repositories for audiovision
Users that are interested in audiovision are comparing it to the libraries listed below
Sorting:
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Keras implementation of conditional waveGAN. Application to knocking sound effects with emotion.☆11Jun 22, 2020Updated 5 years ago
- ☆16Jan 20, 2021Updated 5 years ago
- microphone array speech generator (MASG) in room acoustic☆39Jan 2, 2020Updated 6 years ago
- This is my graduation project in BIT. Title: Noise Reduction Using GRU.☆31May 25, 2023Updated 2 years ago
- CS230 Final Project - Audio Super Resolution☆13Jun 18, 2018Updated 7 years ago
- ☆15Jun 15, 2022Updated 3 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Feb 6, 2025Updated last year
- Code and audio files associated with the paper "Speech Enhancement with Variance Constrained Autoencoders" presented at Interspeech 2019☆15Oct 10, 2019Updated 6 years ago
- Sound field estimation based on physics-constrained neural kernel☆21Jun 9, 2025Updated 8 months ago
- DCCRN: Deep Complex Convolution Recurrent Network☆13Nov 26, 2021Updated 4 years ago
- ☆36Feb 23, 2022Updated 4 years ago
- Consistent dictionary learning algorithm for signal declipping (Python code)☆20Oct 24, 2018Updated 7 years ago
- ☆136Oct 25, 2021Updated 4 years ago
- Audio source separation (mixture to vocal) using the Wavenet☆21Sep 6, 2017Updated 8 years ago
- Generalized RNN beamformer for speech separation☆18Jan 11, 2022Updated 4 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆26Jan 11, 2022Updated 4 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- Supercollider Real-Time Sound Spatialization Framework☆23Apr 4, 2015Updated 10 years ago
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogram…☆25Jul 14, 2020Updated 5 years ago
- This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (…☆28Aug 8, 2022Updated 3 years ago
- ☆60Sep 26, 2020Updated 5 years ago
- MultiSV: scripts for data preparation☆30Jan 18, 2025Updated last year
- Code for the paper: Separate but togerher: Unsupervised Federated Learning for Speech Enhancement from non-iid data☆42Nov 1, 2021Updated 4 years ago
- PyTorch implementation of LiMuSE☆32Oct 11, 2022Updated 3 years ago
- multi-scale time domain speaker extraction☆71Jun 7, 2021Updated 4 years ago
- The code for the ISMIR 2019 paper “Supervised symbolic music style translation using synthetic data”.☆28Nov 21, 2022Updated 3 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago
- An open-source speech separation and enhancement library☆214May 13, 2020Updated 5 years ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆34Mar 22, 2021Updated 4 years ago
- Python toolbox for decorrelating and upmixing audio signals.☆36Sep 11, 2019Updated 6 years ago
- Implements python programs to train and test a Recurrent Neural Network with Tensorflow☆72Feb 3, 2020Updated 6 years ago
- Generate audio signals corresponding to moving sources/receivers in a shoebox-shaped room (MATLAB)☆39Jan 25, 2021Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- music demixing with the sliCQ Transform and PyTorch☆34Nov 10, 2023Updated 2 years ago
- Supporting code for the paper "A study on more realistic room simulation for far-field keyword spotting".☆34Oct 27, 2020Updated 5 years ago