deephdc / audio-classification-tf
A module to classify audio samples.
☆21Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for audio-classification-tf
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year
- SpeechYOLO Interspeech 2019☆42Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆41Updated last year
- Tensorflow Implementation of WaveGlow☆37Updated 4 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated last year
- Control mechanisms to the U-Net architecture for doing multiple source separation instruments☆48Updated 4 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 5 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Updated 2 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆98Updated last year
- Baseline systems for the FSD50K dataset☆67Updated 3 years ago
- ☆58Updated 6 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆26Updated 10 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- End-to-End Speech Recognition Using Tensorflow☆41Updated last year
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 5 years ago
- Python framework for Speech and Music Detection using Keras.☆101Updated last year
- Urban Sound Classification : striving towards a fair comparison☆17Updated 3 years ago
- A pytorch implementation of FFTNet.☆36Updated 6 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- ☆41Updated 2 months ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 2 years ago
- Audio data augmentation examples☆35Updated 6 years ago
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".☆13Updated last year
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- Audio Keyword Search☆12Updated 5 years ago
- A Convolutional Neural Network based Voice Activity Detector for Smartphones☆70Updated 5 years ago