phanxuanphucnd / wav2kwsLinks
Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.
☆12Updated 4 years ago
Alternatives and similar repositories for wav2kws
Users that are interested in wav2kws are comparing it to the libraries listed below
Sorting:
- WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement☆40Updated 4 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆48Updated 6 months ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆32Updated last month
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 4 years ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆48Updated last year
- ☆49Updated 4 years ago
- This repo provides the processed samples of the manuscript "a Mask Free Neural Network for Monaural Speech Enhancement", which was accep…☆37Updated 2 years ago
- fast SpecAugmentation code with numpy and scipy☆31Updated 5 years ago
- ☆51Updated 3 years ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆62Updated 2 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆93Updated 2 years ago
- SpEx+(tied) source code☆86Updated last year
- ☆41Updated 5 years ago
- ☆51Updated 4 years ago
- Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."☆43Updated 6 years ago
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆115Updated 2 years ago
- MultiSV: scripts for data preparation☆27Updated 5 months ago
- Clustering-based methods for overlapping diarization☆80Updated last year
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆65Updated 3 years ago
- PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"☆19Updated 5 years ago
- This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…☆68Updated 3 years ago
- ☆54Updated last year
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Updated 3 years ago
- ☆84Updated last year
- Unofficial Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation(MISO-BF-MISO)☆52Updated 3 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆56Updated 9 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆28Updated 6 months ago