bryanwuAC / audio2vecLinks
☆10Updated 6 years ago
Alternatives and similar repositories for audio2vec
Users that are interested in audio2vec are comparing it to the libraries listed below
Sorting:
- DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.☆378Updated 5 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆145Updated 3 years ago
- Convolutional Neural Network for auto-tagging of audio clips on MagnaTagATune dataset☆59Updated 3 years ago
- Metadata, scripts and baselines for the MTG-Jamendo dataset☆359Updated 3 weeks ago
- Deep Learning experiments for audio classification☆148Updated 8 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆246Updated 6 years ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆450Updated last year
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆343Updated 3 weeks ago
- ☆123Updated 6 years ago
- An open-source speech separation and enhancement library☆214Updated 5 years ago
- Pronunciation Evaluation☆98Updated 6 months ago
- Deep learning for MIR☆245Updated last year
- Deep learning based speech source separation using Pytorch☆319Updated 5 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆371Updated 2 years ago
- (Unofficial) Pytorch Implementation of Music Mood Detection Based On Audio And Lyrics With Deep Neural Net☆112Updated 6 years ago
- A collection of python scripts for extracting and analyzing acoustics from audio files.☆102Updated 2 years ago
- Trains a convolutional autoencoder on Mel Spectrogram images for a list of songs, then displays the encoded latent features using t-SNE.☆21Updated 8 years ago
- A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"☆154Updated 6 years ago
- ☆157Updated 5 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆237Updated last month
- This repository collects information about different data sets for Music Emotion Recognition.☆257Updated 3 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated 2 years ago
- Python implementation of the Short Term Objective Intelligibility measure☆357Updated 2 years ago
- A multi-channel neural network audio classifier using Keras☆268Updated 4 years ago
- A Machine Learning Approach of Emotional Model☆248Updated last year
- A library for augmenting annotated audio data☆236Updated 4 years ago
- ☆130Updated 4 years ago
- Speech Denoising with Deep Feature Losses☆189Updated 5 years ago
- Source Separation Project For ML Jeju Camp 2017☆48Updated 8 years ago
- Simple collection of MIR datasets with metadata and links☆250Updated 5 months ago