meokz / looking-to-listenView external linksLinks
Deep neural network (DNN) for noise reduction, removal of background music, and speech separation
☆173Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for looking-to-listen
Users that are interested in looking-to-listen are comparing it to the libraries listed below
Sorting:
- Executable code based on Google articles☆166Dec 8, 2022Updated 3 years ago
- Include some core functions and model to handle speech separation☆156Jun 24, 2021Updated 4 years ago
- An open-source speech separation and enhancement library☆214May 13, 2020Updated 5 years ago
- Converts spatial videos to red-cyan anaglyph videos.☆10Jan 23, 2024Updated 2 years ago
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- single channel speech separation for music vocal and accompany separate、voice reduce noise☆14Jul 9, 2019Updated 6 years ago
- Collection of EM algorithms for blind source separation of audio signals☆298May 19, 2025Updated 8 months ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆181Nov 16, 2021Updated 4 years ago
- Speech to text library for Rhasspy using Kaldi☆15Dec 9, 2023Updated 2 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆223Mar 24, 2023Updated 2 years ago
- Looking to listen at cocktail party☆36Mar 24, 2023Updated 2 years ago
- Removing various types of noises present in the speech using Deep Neural Networks☆30Apr 17, 2021Updated 4 years ago
- https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques …☆26May 5, 2017Updated 8 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Aug 6, 2019Updated 6 years ago
- This folder contains Matlab programs for a toolbox for supervised speech separation using deep neural networks (DNNs).☆45Feb 7, 2017Updated 9 years ago
- Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.☆521Feb 17, 2022Updated 4 years ago
- Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.☆694Jul 28, 2023Updated 2 years ago
- Removes silence segments from wav audio files☆29Feb 29, 2020Updated 5 years ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆343Sep 5, 2020Updated 5 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16May 14, 2022Updated 3 years ago
- Visual haptic using depth image☆19Dec 20, 2021Updated 4 years ago
- wake word engine benchmark framework☆151Jan 7, 2026Updated last month
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…☆755Apr 6, 2023Updated 2 years ago
- MATLAB real-time/interactive speech tools. This series is obsolete. SP3ARK is the up-to-date series (will be).☆59Mar 4, 2021Updated 4 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- Speech separation with utterance-level PIT experiments☆105Jul 12, 2018Updated 7 years ago
- Python package for noise supression in audio based on DNN☆22Mar 24, 2023Updated 2 years ago
- a python library for speech enhancement☆82Jun 26, 2024Updated last year
- SRP-PHAT using TL-SSC☆22Jun 2, 2015Updated 10 years ago
- Recurrent neural network for audio noise reduction☆5,350Feb 22, 2025Updated 11 months ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Mar 18, 2019Updated 6 years ago
- Unofficial PyTorch implementation of Google AI's VoiceFilter system☆1,191Jul 25, 2024Updated last year
- Convolutional neural nets for single channel speech enhancement☆143Dec 15, 2020Updated 5 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Mar 19, 2024Updated last year
- Keyword spotting for audio with attention (KWS model for audio)☆18Jul 15, 2021Updated 4 years ago
- Unsupervised Representation Learning for Singing Voice Separation☆22Feb 10, 2023Updated 3 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 7 years ago
- octave multi-channel signal processing☆10May 11, 2014Updated 11 years ago