LCF2764 / autoKWS2021_1st_solution
Auto-KWS 2021 Challenge 1st place solution.
☆9Updated 3 years ago
Related projects: ⓘ
- ☆13Updated 3 years ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆15Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆37Updated last year
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- This is a mandarin version of speech separation dataset like WSJMix and LibriMix☆11Updated 2 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆23Updated 2 years ago
- Production first, nn-based on-device signal processing toolkit.☆63Updated last year
- ☆28Updated 2 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Updated 3 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 4 years ago
- Speechflow for emotion recognition related information decomposition☆9Updated 3 years ago
- Computes the MWER (minimum WER) Loss with beam search and negative sampling strategy.☆17Updated last year
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 2 months ago
- Speech samples and code of BEdit-TTS☆32Updated 11 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- End-to-end diarization loss☆19Updated 3 years ago
- ☆20Updated 3 years ago
- A SPMI Lab toolkit for language models.☆11Updated 7 years ago
- MagicData-RAMC Dataset and Baseline☆49Updated 2 years ago
- ☆26Updated last year
- Neural network density models for speech separation.☆20Updated 3 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Updated 4 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆20Updated 3 years ago
- The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"☆53Updated 2 years ago
- Target Speaker Extraction Toolkit☆58Updated 2 weeks ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated 6 months ago