michel-meneses / keyword-minerLinks
A framework for generating labeled audio recordings of single-spoken keywords via automatic forced alignment.
☆11Updated 2 years ago
Alternatives and similar repositories for keyword-miner
Users that are interested in keyword-miner are comparing it to the libraries listed below
Sorting:
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated 2 years ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆19Updated 3 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Updated 4 years ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 9 months ago
- Speech enhancement using mimic loss☆16Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆31Updated 3 years ago
- A small tool to calculate the distribution of audio durations in a directory☆14Updated 2 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆10Updated 2 months ago
- Generalized Minimal Distortion Principle for Blind Source Separation☆21Updated 4 years ago
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- A collection of papers related to speech model compression☆26Updated last year
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆11Updated 8 months ago
- Instructions on downloading and using the LibriAdapt dataset☆46Updated 3 years ago
- ☆16Updated 6 years ago
- ☆17Updated last year
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 3 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- ☆16Updated 4 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)☆17Updated last year
- ☆44Updated last year
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Updated 4 years ago
- ☆12Updated 9 months ago
- A temporal module for PyTorch-ComplexTensor☆44Updated last year
- Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/19…☆16Updated 5 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆40Updated 4 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Updated last year
- Source code for AAAI 22 paper: Hybrid Neural Networks for On-Device Directional Hearing☆17Updated last year