LCF2764 / autoKWS2021_1st_solution
Auto-KWS 2021 Challenge 1st place solution.
☆9Updated 3 years ago
Alternatives and similar repositories for autoKWS2021_1st_solution:
Users that are interested in autoKWS2021_1st_solution are comparing it to the libraries listed below
- ☆13Updated 3 years ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 3 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆20Updated 3 months ago
- A list of papers for child ASR☆35Updated 3 months ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆16Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- ☆29Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN☆22Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 3 months ago
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 4 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆25Updated 4 months ago
- A SPMI Lab toolkit for language models.☆11Updated 7 years ago
- This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…☆66Updated 2 years ago
- Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".☆36Updated 4 years ago
- ☆20Updated 4 years ago
- ☆25Updated 2 months ago
- ☆32Updated 2 years ago
- Speechflow for emotion recognition related information decomposition☆10Updated 3 years ago
- ☆29Updated 2 years ago
- Went online decode demo☆29Updated 3 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/☆34Updated last year