jcvasquezc / DisVoice
feature extraction from speech signals
☆347Updated last month
Related projects: ⓘ
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆311Updated 9 months ago
- Python package for openSMILE☆241Updated 5 months ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆281Updated 2 months ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆232Updated last year
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆122Updated 2 years ago
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆281Updated 10 months ago
- Diarization scoring tools.☆213Updated last year
- Speaker embedding (d-vector) trained with GE2E loss☆270Updated 8 months ago
- An open source dataset for source separation☆363Updated 7 months ago
- Variational Bayes HMM over x-vectors diarization☆251Updated 8 months ago
- End-to-End Neural Diarization☆367Updated 3 years ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆437Updated 2 months ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆363Updated this week
- This is the GitHub page for publicly available emotional speech data.☆314Updated 2 years ago
- ☆128Updated 3 weeks ago
- A library for speech data augmentation in time-domain☆635Updated 3 years ago
- Problem Agnostic Speech Encoder☆439Updated last year
- ☆180Updated 4 months ago
- A statistical model-based Voice Activity Detection☆187Updated 5 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆887Updated last year
- Voice Activity Detection (VAD) using deep learning.☆190Updated 4 years ago
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆248Updated 11 months ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆325Updated 2 months ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆122Updated 2 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆303Updated 3 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆464Updated 3 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆184Updated last year
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆474Updated last week
- Voice Activity Detection based on Deep Learning & TensorFlow☆348Updated last year
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆229Updated 4 years ago