jackgle / YAMNet-transfer-learningLinks
Transfer learning and fine-tuning with YAMNet
☆17Updated 2 years ago
Alternatives and similar repositories for YAMNet-transfer-learning
Users that are interested in YAMNet-transfer-learning are comparing it to the libraries listed below
Sorting:
- Audio classification with VGGish as feature extractor in TensorFlow☆130Updated 3 years ago
- A TFLite-compatible fork of YAMNet from tensorflow/models☆31Updated 5 years ago
- General purpose sound recognition demo☆158Updated last year
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆67Updated 4 years ago
- An implementation of vggish in keras with tf backend☆121Updated 4 years ago
- Speaker recognition ,Voiceprint recognition☆53Updated 5 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Updated 4 years ago
- Environmental sound classification using Deep Learning with extracted features☆165Updated 5 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆74Updated 2 years ago
- ☆15Updated 4 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆217Updated 2 years ago
- PyTorch transcribed audioset classifier, including VGGish and YAMNet, along with utils to manipulate autioset category ontology.☆90Updated 5 months ago
- CNN 1D vs 2D audio classification☆103Updated 6 years ago
- Freesound Audio Tagging 2019☆95Updated 6 years ago
- SpeechYOLO Interspeech 2019☆44Updated 3 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆77Updated 4 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆389Updated 4 years ago
- Include some core functions and model to handle speech separation☆155Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- Voice Activity Detection (VAD) using deep learning.☆199Updated 5 years ago
- ☆109Updated 5 years ago
- Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper☆142Updated 2 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆74Updated 4 years ago
- Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.☆73Updated last year
- Speaker Recognition System using MFCC and GMM.☆24Updated 7 years ago
- A neural attention model for speech command recognition☆186Updated 2 months ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Simple real-time Sound Event Detector based on YAMNet and pyaudio.☆23Updated 5 years ago
- Detect specific type of sound in audio signals☆13Updated last year
- Tensorflow 2.0 implementation of the paper: A Fully Convolutional Neural Network for Speech Enhancement☆257Updated 4 years ago