IBM / MAX-Audio-Classifier
Identify sounds in short audio clips
☆154Updated last year
Related projects ⓘ
Alternatives and complementary repositories for MAX-Audio-Classifier
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆98Updated last year
- ☆58Updated 6 years ago
- Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"☆148Updated 5 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆127Updated 2 years ago
- ☆223Updated 4 years ago
- A library for augmenting annotated audio data☆233Updated 3 years ago
- Machine Learning Sound Classifier☆134Updated 5 years ago
- 8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)☆114Updated 4 years ago
- Visualization toolbox for Sound Event Detection☆116Updated 8 months ago
- Deep Learning experiments for audio classification☆149Updated 7 years ago
- Environmental Sound Classification with Convolutional Neural Networks - paper replication data☆75Updated 7 years ago
- Evaluation toolbox for Sound Event Detection☆145Updated 5 months ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆76Updated 6 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 6 years ago
- Python framework for Speech and Music Detection using Keras.☆101Updated last year
- Single and multichannel sound event detection using convolutional recurrent neural networks. DCASE 2017 real-life sound event detection w…☆185Updated 2 years ago
- ☆130Updated 3 years ago
- Source Separation Project For ML Jeju Camp 2017☆48Updated 7 years ago
- Fetch and use Google's AudioSet dataset☆124Updated 7 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆142Updated 2 years ago
- Learning embeddings for laughter categorization☆34Updated 6 years ago
- DCASE 2018 Baseline systems☆129Updated 5 years ago
- Voice Activity Detection (VAD) using deep learning.☆192Updated 5 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated last year
- DCASE 2017 Baseline system☆82Updated 4 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆98Updated last year
- TensorFlow implementation of "SoundNet".☆145Updated 6 years ago
- Repo associated to the DESED dataset, download and creation of data☆128Updated 4 months ago