Subtitle-Synchronizer / jlibrosa
Librosa equivalent Java library to process audio file adn extract features from it.
☆98Updated 8 months ago
Alternatives and similar repositories for jlibrosa:
Users that are interested in jlibrosa are comparing it to the libraries listed below
- Voice Activity Detection (VAD) using deep learning.☆193Updated 5 years ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Updated 2 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆102Updated 2 years ago
- Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)☆88Updated 5 years ago
- a python library for speech enhancement☆77Updated 6 months ago
- Speex Echo Canceller Python Library☆114Updated 6 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆90Updated 3 years ago
- An open source dataset for source separation☆391Updated 11 months ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆317Updated 4 years ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- This repository is a collection of TTS Models in TFLite☆189Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆349Updated 5 months ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated last year
- simple dnn based vad☆70Updated 6 years ago
- ☆103Updated 4 years ago
- Kaldi-compatible online fbank extractor without external dependencies☆84Updated last month
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆129Updated last month
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆130Updated 11 months ago
- This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approac…☆212Updated 7 months ago
- implementation of rnnoise_16k☆128Updated 3 years ago
- Repo associated to the DESED dataset, download and creation of data☆131Updated 6 months ago
- Python implementation of the Short Term Objective Intelligibility measure☆331Updated last year
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- Unofficial Keras implementation of Google AI VoiceFilter☆37Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆405Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- LogMMSE speech enhancement/noise reduction☆88Updated 4 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆359Updated last year
- A unofficial Pytorch implementation of Google's VoiceFilter☆99Updated last year