jcvasquezc / DisVoiceLinks
feature extraction from speech signals
☆376Updated last month
Alternatives and similar repositories for DisVoice
Users that are interested in DisVoice are comparing it to the libraries listed below
Sorting:
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆331Updated last year
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆257Updated 2 years ago
- Python package for openSMILE☆284Updated 7 months ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆132Updated 3 years ago
- My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. …☆321Updated 3 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆348Updated 9 months ago
- ☆136Updated 10 months ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆123Updated last year
- Speaker embedding (d-vector) trained with GE2E loss☆282Updated last year
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆134Updated 6 months ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆368Updated 2 years ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆444Updated last year
- This is the GitHub page for publicly available emotional speech data.☆357Updated 3 years ago
- Variational Bayes HMM over x-vectors diarization☆272Updated last year
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆242Updated 5 years ago
- Charsiu: A neural phonetic aligner.☆307Updated 2 years ago
- End-to-End Neural Diarization☆403Updated 3 years ago
- Diarization scoring tools.☆251Updated 2 years ago
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆271Updated last year
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆293Updated last year
- ☆192Updated last year
- A library for speech data augmentation in time-domain☆666Updated 3 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆371Updated 11 months ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆216Updated 4 months ago
- A Cooperative Voice Analysis Repository for Speech Technologies☆365Updated 4 years ago
- An open source dataset for source separation☆432Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activities☆212Updated 3 years ago
- Problem Agnostic Speech Encoder☆442Updated 2 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆317Updated 4 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆264Updated 3 years ago