FIGLAB / DirectionOfVoiceLinks
Direction-of-Voice (DoV) Estimation for Intuitive Speech Interaction with Smart Devices Ecosystems
☆34Updated 3 years ago
Alternatives and similar repositories for DirectionOfVoice
Users that are interested in DirectionOfVoice are comparing it to the libraries listed below
Sorting:
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- The Cone of Silence:☆155Updated 3 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆110Updated last year
- Include some core functions and model to handle speech separation☆155Updated 4 years ago
- Keras framework for speech enhancement using relativistic GANs☆52Updated 5 years ago
- ☆59Updated 7 years ago
- Benchmark for sound event localization task of DCASE 2019 challenge☆77Updated 4 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆82Updated 5 years ago
- Speech Denoising with Deep Feature Losses☆187Updated 5 years ago
- Deep Neural Network for Speaker Count Estimation☆155Updated 5 years ago
- The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.☆203Updated 2 years ago
- Accompanying repository for Ubicoustics: Plug-and-Play Acoustic Activity Recognition☆174Updated 2 years ago
- A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…☆213Updated 2 years ago
- ☆60Updated 4 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 5 years ago
- Audio Denoising with Deep Network Priors☆163Updated 4 years ago
- Implements python programs to train and test a Recurrent Neural Network with Tensorflow☆72Updated 5 years ago
- Tensorflow implementation for Speech Enhancement (DDAE)☆48Updated 7 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆221Updated 2 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆130Updated 3 years ago
- A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"☆152Updated 5 years ago
- Voice Activity Detection (VAD) using deep learning.☆198Updated 5 years ago
- ICASSP2019 Tutorial: Detection and Classification of Acoustic Scenes and Events / Code examples☆42Updated 3 months ago
- Voice Conversion using Cycle GAN's For Non-Parallel Data☆123Updated 6 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆74Updated 4 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆72Updated 6 years ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆55Updated 4 years ago
- Speaker identification with VGGVox network☆84Updated 6 years ago
- ☆65Updated 2 years ago
- Executable code based on Google articles☆164Updated 2 years ago