MikeMpapa / CNNs-Speech-Music-DiscriminationLinks
A deep learning framework for Speech-Music discrimination of continuous audio streams
☆68Updated 7 years ago
Alternatives and similar repositories for CNNs-Speech-Music-Discrimination
Users that are interested in CNNs-Speech-Music-Discrimination are comparing it to the libraries listed below
Sorting:
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Evaluation toolbox for Sound Event Detection☆149Updated last year
- A library for augmenting annotated audio data☆235Updated 4 years ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆101Updated 2 years ago
- Visualization toolbox for Sound Event Detection☆122Updated last year
- Python framework for Speech and Music Detection using Keras.☆108Updated 2 years ago
- DCASE 2017 Baseline system☆82Updated 5 years ago
- Voice Activity Detector☆73Updated 2 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- CNN-based singing voice detection experiments☆37Updated 7 years ago
- DCASE 2018 Baseline systems☆129Updated 5 years ago
- Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"☆150Updated 5 years ago
- ☆26Updated 7 years ago
- simple-minded audio classifier in python (using MFCC and GMM)☆83Updated 2 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Events☆132Updated 4 months ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆143Updated 2 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- A platform for the collaborative creation of open audio collections labeled by humans and based on Freesound content.☆140Updated last year
- DCASE 2016 Baseline system, python implementation☆53Updated 8 years ago
- openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit☆82Updated 4 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆130Updated 3 years ago
- Voice Activity Detection system (Matlab-based implementation)☆108Updated 8 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Updated 7 years ago
- ☆59Updated 7 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago
- Deep Neural Network for Speaker Count Estimation☆153Updated 4 years ago
- ☆27Updated 7 years ago
- Phoneme Recognition using RecNet☆97Updated 8 years ago
- Human Voice Wave Samples☆84Updated 10 years ago
- SiSEC MUS 2018 Submission System☆43Updated 5 years ago