harritaylor / torchvggishView external linksLinks
Pytorch port of Google Research's VGGish model used for extracting audio features.
☆408Nov 3, 2021Updated 4 years ago
Alternatives and similar repositories for torchvggish
Users that are interested in torchvggish are comparing it to the libraries listed below
Sorting:
- Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.☆87May 16, 2019Updated 6 years ago
- ☆1,662Jul 25, 2024Updated last year
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆392Jun 16, 2021Updated 4 years ago
- ☆231Feb 9, 2020Updated 6 years ago
- An implementation of vggish in keras with tf backend☆123Apr 11, 2021Updated 4 years ago
- ☆20Nov 3, 2021Updated 4 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆350Sep 13, 2021Updated 4 years ago
- ☆435Nov 1, 2023Updated 2 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- OpenL3: Open-source deep audio and image embeddings☆578Jun 17, 2023Updated 2 years ago
- ☆508Jun 25, 2024Updated last year
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,419May 21, 2023Updated 2 years ago
- PyTorch code for training and evaluating MOVE, musically-motivated version embeddings☆50Jul 6, 2023Updated 2 years ago
- ESC-50: Dataset for Environmental Sound Classification☆1,738Mar 20, 2024Updated last year
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- ☆17Jun 17, 2021Updated 4 years ago
- Quasi-Periodic WaveNet Pytorch implementation☆13Mar 27, 2021Updated 4 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 2 years ago
- Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018☆202Apr 3, 2021Updated 4 years ago
- (2020) Video Classification Neural Network☆30Feb 18, 2020Updated 5 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,132Nov 24, 2025Updated 2 months ago
- Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch☆73Sep 27, 2021Updated 4 years ago
- ☆99Nov 25, 2021Updated 4 years ago
- A lightweight library for Frechet Audio Distance calculation.☆308Updated this week
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,230Dec 27, 2025Updated last month
- ☆58Nov 2, 2020Updated 5 years ago
- Code for https://arxiv.org/abs/1712.00254☆16Dec 6, 2017Updated 8 years ago
- Kaggle | 1st place solution for Freesound Audio Tagging 2019☆316Jun 22, 2022Updated 3 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Dec 4, 2021Updated 4 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Jul 13, 2023Updated 2 years ago
- Code of PhoenixLin(3rd place) in the 2nd Youtube8M Video Understanding Challenge☆208Aug 1, 2019Updated 6 years ago
- A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.☆366Feb 2, 2026Updated last week
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆145Nov 21, 2022Updated 3 years ago
- Audio processing by using pytorch 1D convolution network☆1,117Dec 7, 2025Updated 2 months ago
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- PyTorch implementation of NetVLAD & Online Hardest Triplet Loss.☆463Nov 3, 2018Updated 7 years ago
- Efficient Training of Audio Transformers with Patchout☆371Jan 12, 2024Updated 2 years ago
- Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.☆676Dec 11, 2023Updated 2 years ago
- Interspeech 2019 tutorial materials☆49Sep 26, 2019Updated 6 years ago