roman-vygon / triplet_loss_kwsLinks

Learning Efficient Representations for Keyword Spotting with Triplet Loss

☆111

Alternatives and similar repositories for triplet_loss_kws

Users that are interested in triplet_loss_kws are comparing it to the libraries listed below

Sorting:

dobby-seo / Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆107Updated 2 years ago
ranchlai / speaker-verification
Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN
☆91Updated 3 years ago
ARM-software / keyword-transformer
Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769
☆131Updated 3 years ago
iiscleap / NeuralPlda
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
☆99Updated 5 years ago
ArchitParnami / Few-Shot-KWS
Few-Shot Keyword Spotting
☆64Updated 4 years ago
lenovo-voice / THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEM
☆51Updated 4 years ago
nicklashansen / voice-activity-detection
Voice Activity Detection (VAD) using deep learning.
☆196Updated 5 years ago
zycv / awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
☆260Updated 3 years ago
yufan-aslp / AliMeeting
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆120Updated 3 years ago
Qualcomm-AI-research / bcresnet
☆65Updated 2 years ago
mashrurmorshed / Torch-KWT
Unofficial PyTorch implementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting", Berg et al. 2021.
☆37Updated 2 years ago
juanmc2005 / SpeakerEmbeddingLossComparison
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…
☆60Updated 4 years ago
Xflick / EEND_PyTorch
A PyTorch implementation of End-to-End Neural Diarization
☆108Updated 2 years ago
lawlict / ECAPA-TDNN
☆103Updated 3 years ago
VITA-Group / AutoSpeech
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …
☆208Updated 2 years ago
seongmin-kye / meta-SR
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
☆74Updated 4 years ago
zhenghuatan / rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …
☆135Updated last year
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆106Updated 3 years ago
RicherMans / Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
☆94Updated last year
zyzisyz / mfa_conformer
☆150Updated 2 years ago
alibabasglab / FRCRN
☆153Updated 7 months ago
nikvaessen / w2v2-speaker
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
☆145Updated 3 years ago
yuyq96 / D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
☆87Updated 2 years ago
HolgerBovbjerg / data2vec-KWS
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆28Updated 3 months ago
KrishnaDN / x-vector-pytorch
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
☆105Updated 4 years ago
burchim / EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆216Updated 2 years ago
skgusrb12 / voice_activity_detection
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆26Updated 4 years ago
Le-Xiaohuai-speech / DPCRN_DNS3
Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"
☆205Updated last year
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆39Updated 2 years ago
jymsuper / VAD_tutorial
Simple DNN based Voice Activity Detection (VAD) using Pytorch
☆40Updated 5 years ago