bootphon / shennong
A Python toolbox for speech features extraction
☆161Updated 2 years ago
Alternatives and similar repositories for shennong:
Users that are interested in shennong are comparing it to the libraries listed below
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆106Updated 2 years ago
- Feature extractor for DL speech processing.☆65Updated 3 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- ☆185Updated last year
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆59Updated 4 years ago
- Implementation of audio degradation processes☆102Updated 9 years ago
- ☆60Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆141Updated 2 years ago
- target speaker extraction and verification for multi-talker speech☆178Updated 4 years ago
- A python IO interface for data accessing in kaldi☆39Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆81Updated 3 years ago
- Speech enhancement system for the CHiME-5 dinner party scenario☆109Updated 3 months ago
- A personal toolkit for single/multi-channel speech recognition & enhancement & separation.☆142Updated last year
- Keras-based python framework to compute phonological posterior probabilities from audio files☆43Updated 2 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Updated 2 years ago
- A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an …☆71Updated 3 weeks ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Updated last year
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆143Updated last year
- A PyTorch implementation of End-to-End Neural Diarization☆109Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- ☆91Updated 2 years ago
- Python implementation of the SRMR toolbox☆122Updated 10 months ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Updated 4 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated 2 years ago