Curriculum Vitae of Quan Wang
☆15Dec 13, 2025Updated 2 months ago
Alternatives and similar repositories for CurriculumVitae
Users that are interested in CurriculumVitae are comparing it to the libraries listed below
Sorting:
- A fast parallel implementation of RNN Transducer.☆12Apr 8, 2025Updated 10 months ago
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Nov 12, 2020Updated 5 years ago
- ☆13Mar 25, 2021Updated 4 years ago
- ☆33Aug 6, 2021Updated 4 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Jun 17, 2022Updated 3 years ago
- A repository for Chinese text normalization.☆20May 2, 2021Updated 4 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- ☆18Oct 31, 2022Updated 3 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 3 months ago
- ☆20Nov 22, 2020Updated 5 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆43Nov 18, 2022Updated 3 years ago
- ☆24Mar 13, 2020Updated 5 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Jul 15, 2021Updated 4 years ago
- using microphone☆17Sep 2, 2021Updated 4 years ago
- Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.☆23Jan 24, 2021Updated 5 years ago
- MagicData-RAMC Dataset and Baseline☆58Sep 13, 2022Updated 3 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Mar 3, 2020Updated 6 years ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- ☆32May 30, 2021Updated 4 years ago
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- Add motion-based magic to your React Native apps! ThinkSys Mediapipe Plugin offers real-time pose detection for iOS, with easy integratio…☆32Jan 19, 2026Updated last month
- ASCEND Chinese-English code-switching dataset☆30Jul 12, 2022Updated 3 years ago
- An Attention-based Neural Network Approach for Single Channel Speech Enhancement☆25Dec 1, 2019Updated 6 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆80Jan 9, 2025Updated last year
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆50Sep 2, 2025Updated 6 months ago
- ☆32Oct 28, 2022Updated 3 years ago
- ☆10Apr 16, 2020Updated 5 years ago
- The voice project of Embedded System use STM32☆11Dec 25, 2013Updated 12 years ago
- ☆16Feb 7, 2019Updated 7 years ago
- A CSRankings-like index for speech researchers☆35Oct 16, 2024Updated last year