michel-meneses / keyword-minerLinks

A framework for generating labeled audio recordings of single-spoken keywords via automatic forced alignment.

☆11

Alternatives and similar repositories for keyword-miner

Users that are interested in keyword-miner are comparing it to the libraries listed below

Sorting:

edufonseca / shift_sec
Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".
☆13Updated 2 years ago
DanielLin94144 / Test-time-adaptation-ASR-SUTA
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…
☆19Updated 3 years ago
aispeech-lab / TinyWASE
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Updated 4 years ago
ffxiong / uaspeech
Baseline kaldi script for UA-SPEECH corpus
☆30Updated 9 months ago
OSU-slatelab / mimic-enhance
Speech enhancement using mimic loss
☆16Updated 5 years ago
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
haoxiangsnr / audioinfo
A small tool to calculate the distribution of audio durations in a directory
☆14Updated 2 years ago
robflynnyh / long-context-asr
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆10Updated 2 months ago
fakufaku / 2020_interspeech_gmdp
Generalized Minimal Distortion Principle for Blind Source Separation
☆21Updated 4 years ago
desh2608 / css
PyTorch implementation of Continuous Speech Separation
☆13Updated 2 years ago
pyf98 / speech-model-compression
A collection of papers related to speech model compression
☆26Updated last year
chaufanglin / Normal2Whisper
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆11Updated 8 months ago
akhilmathurs / libriadapt
Instructions on downloading and using the LibriAdapt dataset
☆46Updated 3 years ago
iiscleap / DIHARD-2019-baseline
☆16Updated 6 years ago
dhimasryan / TMHINT-QI-VoiceMOS2023
☆17Updated last year
RicherMans / PSL
Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"
☆30Updated 3 years ago
Lhx94As / E2E-language-diarization
Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>
☆19Updated 3 years ago
popcornell / OSDC
☆16Updated 4 years ago
haoheliu / DCASE_2022_Task_5
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆28Updated 3 years ago
glory20h / FitHuBERT
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)
☆17Updated last year
k2-fsa / multi_quantization
☆44Updated last year
X-LANCE / BER
Balanced Error Rate for Speaker Diarization
☆32Updated 2 years ago
popcornell / MicRank
MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.
☆22Updated 4 years ago
hmohebbi / disentangling_representations
☆12Updated 9 months ago
kamo-naoyuki / pytorch_complex
A temporal module for PyTorch-ComplexTensor
☆44Updated last year
alecokas / BiLatticeRNN-Confidence
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/19…
☆16Updated 5 years ago
RicherMans / UIT_Mobile
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
☆23Updated 2 years ago
desh2608 / pytorch-tdnn
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆40Updated 4 years ago
nobutaka-ito / pulse
Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)
☆43Updated last year
wanganran / HybridBeam
Source code for AAAI 22 paper: Hybrid Neural Networks for On-Device Directional Hearing
☆17Updated last year