FIGLAB / DirectionOfVoiceLinks
Direction-of-Voice (DoV) Estimation for Intuitive Speech Interaction with Smart Devices Ecosystems
☆35Updated 3 years ago
Alternatives and similar repositories for DirectionOfVoice
Users that are interested in DirectionOfVoice are comparing it to the libraries listed below
Sorting:
- The Cone of Silence:☆156Updated 3 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 3 years ago
- Benchmark for sound event localization task of DCASE 2019 challenge☆78Updated 5 years ago
- Include some core functions and model to handle speech separation☆156Updated 4 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.☆29Updated 6 years ago
- ☆59Updated 7 years ago
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆111Updated last year
- Deep Neural Network for Speaker Count Estimation☆157Updated 5 years ago
- End-to-End Speech Recognition Using Tensorflow☆41Updated 2 years ago
- The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.☆209Updated 3 years ago
- Speech Denoising with Deep Feature Losses☆189Updated 5 years ago
- Audio Denoising with Deep Network Priors☆163Updated 5 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆74Updated 4 years ago
- Baseline method for sound event localization task of DCASE 2020 challenge☆57Updated 5 years ago
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆83Updated 5 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 7 years ago
- Keras framework for speech enhancement using relativistic GANs☆52Updated 5 years ago
- A implementation of Power Normalized Cepstral Coefficients: PNCC☆54Updated 6 years ago
- DOA, VAD and KWS for ReSpeaker Microphone Array☆324Updated 7 years ago
- Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…☆222Updated 2 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆114Updated 6 years ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆89Updated 3 years ago
- Voice Conversion using Cycle GAN's For Non-Parallel Data☆124Updated 7 years ago
- ☆60Updated 5 years ago
- The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmente…☆126Updated 2 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆132Updated 5 years ago
- Voice Activity Detection (VAD) using deep learning.☆204Updated 6 years ago
- ICASSP2019 Tutorial: Detection and Classification of Acoustic Scenes and Events / Code examples☆42Updated 7 months ago
- Tensorflow implementation for Speech Enhancement (DDAE)☆48Updated 7 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Updated 2 years ago