MarvinLvn / voice-type-classifier
A deep learning model for classifying audio frames into [SPEECH, KCHI, CHI, MAL, FEM] classes.
☆43Updated 3 months ago
Alternatives and similar repositories for voice-type-classifier:
Users that are interested in voice-type-classifier are comparing it to the libraries listed below
- Tools to process the UltraSuite data☆11Updated 5 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆38Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- ☆59Updated 4 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆79Updated 3 years ago
- A Python toolbox for speech features extraction☆160Updated last year
- ☆40Updated 2 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated last year
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆85Updated 2 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆83Updated 5 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆101Updated 5 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- ☆185Updated 8 months ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 6 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆89Updated 4 years ago
- Implementation of audio degradation processes☆101Updated 9 years ago
- ☆28Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- ☆37Updated 2 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 4 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆101Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆140Updated 2 years ago
- ☆51Updated last year
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆57Updated 3 years ago
- ☆25Updated 3 years ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago