bryanwuAC / audio2vecLinks
☆10Updated 6 years ago
Alternatives and similar repositories for audio2vec
Users that are interested in audio2vec are comparing it to the libraries listed below
Sorting:
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆144Updated 2 years ago
- pytorch implementation of wavenet autoencoder https://arxiv.org/pdf/1704.01279.pdf☆12Updated 7 years ago
- An open-source speech separation and enhancement library☆213Updated 5 years ago
- A Python toolbox for speech features extraction☆165Updated 2 years ago
- Python library for handling audio datasets.☆138Updated 2 years ago
- ☆15Updated 2 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆229Updated 2 months ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Updated 2 years ago
- Trains a convolutional autoencoder on Mel Spectrogram images for a list of songs, then displays the encoded latent features using t-SNE.☆21Updated 8 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago
- Source Separation Project For ML Jeju Camp 2017☆48Updated 8 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Events☆132Updated 7 months ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆275Updated 2 years ago
- A WaveNet-based vocoder for fast inference☆162Updated 7 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆148Updated 2 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆245Updated 6 years ago
- Evaluation toolbox for Sound Event Detection☆156Updated last year
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Updated 2 years ago
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Updated 5 years ago
- ☆131Updated 7 years ago
- Voxceleb1 i-vector based speaker recognition system☆44Updated 7 years ago
- Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"☆152Updated 6 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 7 years ago
- Speech Enhancement using Bayesian WaveNet☆98Updated 7 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆53Updated 2 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Python implementation of the Short Term Objective Intelligibility measure☆351Updated last year
- A statistical model-based Voice Activity Detection☆194Updated 6 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆87Updated 6 years ago