ryuuji06 / keyword-spotting

In this repository, I implement a system for detecting specific spoken words in speech signal. When reading a speech signal, I detect not only the presence, but also the time position of the keyword. For this purpose, I use a CNN-RNN network, with a CTC (Connectionist Temporal Classification) loss function.

☆18

Alternatives and similar repositories for keyword-spotting

Users that are interested in keyword-spotting are comparing it to the libraries listed below

Sorting:

Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆39Updated 2 years ago
HolgerBovbjerg / data2vec-KWS
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆28Updated 2 months ago
mrusci / ondevice-learning-kws
Test Framework for few-shot open set KWS
☆31Updated 6 months ago
re9ulus / BC-ResNet
BC-ResNet for Keyword Spotting
☆38Updated 3 years ago
Qualcomm-AI-research / bcresnet
☆62Updated last year
aizhiqi-work / MM-KWS
Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"
☆29Updated last week
swagshaw / TorchKWS
Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.
☆25Updated last year
ncsoft / PhonMatchNet
Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)
☆48Updated 11 months ago
dianwen-ng / Keyword-Spotting-ConvMixer
☆31Updated 2 years ago
kaistmm / Metric-UD-KWS
Official code for Metric learning for user-defined keyword spotting
☆31Updated last year
jsvir / vad
[Tiny VAD] SG-VAD: Stochastic Gates Based Speech Activity Detection
☆29Updated last month
skgusrb12 / voice_activity_detection
Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)
☆26Updated 4 years ago
hongfeixue / StutteringSpeechChallenge
SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆13Updated 11 months ago
zhaoyi2 / audio_augment
A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN
☆22Updated 4 years ago
gusrud1103 / LibriPhrase
Recipe for LibriPhrase
☆28Updated last year
Jasson-Chen / Add_noise_and_rir_to_speech
The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generat…
☆29Updated 3 years ago
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆81Updated last year
DataSenseiAryan / GoogleSpeechCommandLowFootprint
This repository contains the Code for SOTA model on Google Speech Command V2 dataset.
☆15Updated last year
mayank-git-hub / ETE-Speech-Recognition
Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch
☆25Updated 9 months ago
tango4j / llm_speaker_tagging
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆14Updated 11 months ago
BUTSpeechFIT / DVBx
Discriminative Training of VBx Diarization
☆24Updated 7 months ago
VoxBlink2 / ScriptsForVoxBlink2
Official Repository For VoxBlink2
☆67Updated 9 months ago
Xiaobin-Rong / SEtrain
A training code template for DNN-based speech enhancement.
☆92Updated last month
tarun360 / SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal.
☆14Updated 2 years ago
jiay7 / wenet_onlinedecode
Went online decode demo
☆29Updated 4 years ago
dmlguq456 / NeXt_TDNN_ASV
Official repository of NeXt-TDNN for speaker verification
☆71Updated 7 months ago
Diamondfan / Child-ASR-Paper
A list of papers for child ASR
☆40Updated 7 months ago
VoxBlink / ScriptsForVoxBlink
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.
☆26Updated last year
Audio-WestlakeU / UMA-ASR
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆27Updated 5 months ago
AmirmohammadRostami / KeywordsSpotting-EfficientNet-A0
EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting
☆23Updated 2 years ago