celebrity-audio-collection / videoprocess
CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.
☆70Updated 4 years ago
Related projects: ⓘ
- ☆27Updated this week
- ☆69Updated 3 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- PyTorch implementation of Densely Connected Time Delay Neural Network☆83Updated last year
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆111Updated 2 years ago
- A pytorch implementation of xvector embedding☆78Updated 4 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆62Updated 5 years ago
- PyTorch implementation of RPNSD☆60Updated 3 months ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆53Updated 3 years ago
- ☆63Updated this week
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Seeing Wake Words: Audio-visual Keyword Spotting☆62Updated 4 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆101Updated 5 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆75Updated 3 years ago
- Region proposal network based small-footprint keyword spotting (Pytorch)☆51Updated 10 months ago
- ☆55Updated 4 years ago
- py-webrtcvad wrapper for trimming speech clips☆47Updated 2 years ago
- ☆41Updated 3 years ago
- PyTorch implementation of a self-attentive speaker embedding☆16Updated 4 years ago
- Speech separation with utterance-level PIT experiments☆100Updated 6 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆29Updated 4 years ago
- ☆114Updated 3 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆98Updated 4 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 4 years ago
- ☆59Updated 3 years ago
- A summary of speech data augment algorithms☆64Updated 3 years ago