gveres / donateacry-corpusLinks
An infant cry audio corpus that's being built through the Donate-a-cry campaign - see http://donateacry.com
☆188Updated 5 years ago
Alternatives and similar repositories for donateacry-corpus
Users that are interested in donateacry-corpus are comparing it to the libraries listed below
Sorting:
- Voice Activity Detection (VAD) using deep learning.☆201Updated 6 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆217Updated 2 years ago
- Deep Neural Network for Speaker Count Estimation☆156Updated 5 years ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆180Updated 3 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 5 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 4 years ago
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…☆58Updated 6 years ago
- LogMMSE speech enhancement/noise reduction☆89Updated 5 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆113Updated 6 years ago
- Include some core functions and model to handle speech separation☆155Updated 4 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆102Updated last month
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆110Updated last year
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆141Updated 2 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆67Updated 4 years ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆89Updated 3 years ago
- Urdu Language Speech Emotional Corpus☆46Updated 6 years ago
- A Python toolbox for speech features extraction☆165Updated 2 years ago
- Visualization toolbox for Sound Event Detection☆123Updated last year
- Repo associated to the DESED dataset, download and creation of data☆139Updated last year
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆110Updated 3 years ago
- Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)☆91Updated 6 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- A statistical model-based Voice Activity Detection☆194Updated 6 years ago
- Recognition of baby cry audio signal☆279Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Updated 6 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆369Updated 2 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago