wentaozhu / speechnas
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
☆30Updated last year
Related projects: ⓘ
- Streaming Audiotransformers for online Audio tagging☆39Updated 3 months ago
- with alignment learning and continuous wavelet transform☆19Updated 2 years ago
- ☆11Updated 2 years ago
- ☆13Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- A small tool to calculate the distribution of audio durations in a directory☆13Updated last year
- End-to-end diarization loss☆19Updated 3 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 3 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆13Updated last year
- Temporary anonymous version☆22Updated 6 months ago
- ☆13Updated 2 months ago
- ☆13Updated this week
- PyTorch implementation of Continuous Speech Separation☆13Updated last year
- ☆16Updated 2 years ago
- A library of speech gadgets.☆13Updated last year
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated 11 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated 8 months ago
- ☆15Updated 3 years ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆23Updated last month
- RepVgg + HiFiGAN☆33Updated 2 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated last year
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆19Updated 3 years ago
- Spherical residual vector quantization (SRVQ)☆26Updated 3 weeks ago
- ☆35Updated 2 years ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆8Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 2 years ago
- ☆13Updated this week