Subtitle-Synchronizer / jlibrosa
Librosa equivalent Java library to process audio file adn extract features from it.
☆94Updated 4 months ago
Related projects: ⓘ
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆68Updated 5 years ago
- Speex Echo Canceller Python Library☆112Updated 6 years ago
- Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)☆87Updated 5 years ago
- Noise15 , Noisex-92 and Nonspeech☆29Updated 3 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆86Updated 3 years ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Updated last year
- A demo of android key word spoting based on tensorflow tutial example☆25Updated 4 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆126Updated 4 months ago
- ☆59Updated 3 years ago
- Voice activity detection (VAD) paper and code(From 198*~ )and its classification.☆84Updated 7 months ago
- Voice Activity Detection (VAD) using deep learning.☆190Updated 4 years ago
- A statistical model-based Voice Activity Detection☆187Updated 5 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆57Updated 3 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆127Updated 8 months ago
- An open-source speech separation and enhancement library☆211Updated 4 years ago
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEE…☆173Updated last year
- Voice Activity Detection LSTM-RNN learning model☆50Updated 6 years ago
- LogMMSE speech enhancement/noise reduction☆87Updated 4 years ago
- Unofficial Keras implementation of Google AI VoiceFilter☆34Updated last year
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆140Updated last year
- Voice Activity Detection☆29Updated 6 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆217Updated last month
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆40Updated 2 years ago
- VArious audio processing tasks☆22Updated 2 years ago
- ☆99Updated 4 years ago
- DCCRN with various loss functions☆88Updated last year
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆122Updated 4 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆98Updated last year
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆39Updated 4 years ago
- Repo associated to the DESED dataset, download and creation of data☆121Updated 2 months ago