Implementation of music genre classification, audio-to-vec, song recommender, and music search in mxnet
☆56Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for mxnet-audio
Users that are interested in mxnet-audio are comparing it to the libraries listed below
Sorting:
- ☆12May 1, 2019Updated 6 years ago
- ☆12Jun 8, 2017Updated 8 years ago
- keras project for audio deep learning☆40Apr 10, 2018Updated 7 years ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Oct 27, 2017Updated 8 years ago
- Granular Synthesis in React☆13Oct 14, 2017Updated 8 years ago
- Materials for the Computational Music Creativity course at UPF-MTG (Spring 2020)☆13Jun 22, 2022Updated 3 years ago
- c++ implementation for ssh detector for object detect. something likes ssd☆14Jan 14, 2019Updated 7 years ago
- C++ Version Code for 'Recurrent Scale Approximation for Object Detection in CNN' in ICCV 2017☆16Jan 29, 2018Updated 8 years ago
- this repo attemps to reproduce DSOD: Learning Deeply Supervised Object Detectors from Scratch use gluon reimplementation☆14Aug 18, 2018Updated 7 years ago
- Windows port for caffe using cmake. Deprecated. Use Microsoft's port instead.☆15Jul 14, 2014Updated 11 years ago
- A CNN for denoising speech.☆17Jun 2, 2019Updated 6 years ago
- Mxnet video io reading performance optimization☆43Mar 29, 2018Updated 7 years ago
- Scene Classification using Audio in the nearby Environment.☆19Sep 4, 2019Updated 6 years ago
- A repository for a presentation on debugging and performances tricks with MXNet Gluon☆24Sep 17, 2018Updated 7 years ago
- Visual interface for exploring Freesound content and creating music in a 2-dimensional space☆21Sep 15, 2025Updated 5 months ago
- Focal loss for mxnet SSD example.☆23Dec 19, 2017Updated 8 years ago
- tools for MegaFace evaluation, e.g. plotting evalaution results☆21Jul 24, 2018Updated 7 years ago
- This is the supplemental repository for ISMIR 2019 paper GENERATING STRUCTURED DRUM PATTERN USING VARIATIONAL AUTOENCODER AND SELF-SIMILA…☆23Oct 28, 2019Updated 6 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- ☆27Apr 21, 2017Updated 8 years ago
- A summary of my recently surveyed papers. Some papers on Arxiv with unimpressive results are not included.☆25Apr 18, 2018Updated 7 years ago
- Pytorch implementation of "Sample-level Deep Convolutional Neural Networks for Music Auto-tagging Using Raw Waveforms"☆59Jul 23, 2023Updated 2 years ago
- MXNet implementation of WaveNet☆19Oct 20, 2016Updated 9 years ago
- ☆24Oct 12, 2018Updated 7 years ago
- open unmix - music source separation for tensorflow☆22Nov 27, 2019Updated 6 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆31Jun 17, 2024Updated last year
- Revisiting Singing Voice Detection : a Quantitative Review and the Future Outlook☆68Nov 21, 2022Updated 3 years ago
- Im2Flow: Motion Hallucination from Static Images for Action Recognition (CVPR 2018)☆56Sep 4, 2018Updated 7 years ago
- This is an PyTorch implementation of PredNet paper.☆29Jan 3, 2018Updated 8 years ago
- A TFLite-compatible fork of YAMNet from tensorflow/models☆31Jun 13, 2020Updated 5 years ago
- This repository contains the migrated code of Spleeter from Deezer in TF2.0☆28Jan 20, 2021Updated 5 years ago
- Image Retrieval Experiment Using Triplet Loss☆26Dec 12, 2016Updated 9 years ago
- This contains my M.Tech project work on using Deep Leanring for learning graph representations. Data will be provided on request☆33Apr 4, 2017Updated 8 years ago
- This repo contains code for *FD-MobileNet: Improved MobileNet with A Fast Downsampling Strategy*.☆35May 3, 2018Updated 7 years ago
- Music genre classification model using CRNN☆71Sep 27, 2018Updated 7 years ago
- PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Spea…☆32Jan 13, 2020Updated 6 years ago
- 一个比较复杂的生成真实场景文字的Python项目。原项目只能生成英文。 经过修改之后能够生成中文。 并且我也添加了图片中文字的切割和对应label的保存代码。☆33May 4, 2017Updated 8 years ago
- Use Cafffe to do Face Attributes MultiTask Classification based on CelebA data sets☆35Apr 6, 2022Updated 3 years ago