Pytorch port of Google Research's VGGish model used for extracting audio features.
☆409Nov 3, 2021Updated 4 years ago
Alternatives and similar repositories for torchvggish
Users that are interested in torchvggish are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.☆87May 16, 2019Updated 6 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆393Jun 16, 2021Updated 4 years ago
- ☆1,685Jul 25, 2024Updated last year
- ☆231Feb 9, 2020Updated 6 years ago
- An implementation of vggish in keras with tf backend☆123Apr 11, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆20Nov 3, 2021Updated 4 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆355Sep 13, 2021Updated 4 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,440May 21, 2023Updated 2 years ago
- OpenL3: Open-source deep audio and image embeddings☆582Jun 17, 2023Updated 2 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Feb 1, 2019Updated 7 years ago
- ☆437Nov 1, 2023Updated 2 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Dec 4, 2021Updated 4 years ago
- ☆509Jun 25, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018☆207Apr 3, 2021Updated 4 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- (2020) Video Classification Neural Network☆30Feb 18, 2020Updated 6 years ago
- ESC-50: Dataset for Environmental Sound Classification☆1,770Mar 20, 2024Updated 2 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 3 years ago
- ☆99Nov 25, 2021Updated 4 years ago
- PyTorch code for training and evaluating MOVE, musically-motivated version embeddings☆50Jul 6, 2023Updated 2 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆150Jul 13, 2023Updated 2 years ago
- ☆17Jun 17, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Generate embedding vectors from audio files☆60Sep 17, 2025Updated 6 months ago
- Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch☆73Sep 27, 2021Updated 4 years ago
- Code of PhoenixLin(3rd place) in the 2nd Youtube8M Video Understanding Challenge☆208Aug 1, 2019Updated 6 years ago
- This is the material for paper "IMPROVING AUTOMATIC DRUM TRANSCRIPTION USING LARGE-SCALE AUDIO-TO-MIDI ALIGNED DATA"☆15Dec 11, 2020Updated 5 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,140Nov 24, 2025Updated 4 months ago
- A lightweight library for Frechet Audio Distance calculation.☆312Feb 11, 2026Updated last month
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆500Jun 11, 2021Updated 4 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,241Dec 27, 2025Updated 3 months ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆145Nov 21, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- ☆47Nov 13, 2021Updated 4 years ago
- Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.☆685Dec 11, 2023Updated 2 years ago
- The code for DCASE2021 task5 submission.☆20Feb 21, 2022Updated 4 years ago
- Efficient Training of Audio Transformers with Patchout☆371Jan 12, 2024Updated 2 years ago
- PyTorch implementation of NetVLAD & Online Hardest Triplet Loss.☆463Nov 3, 2018Updated 7 years ago
- A collection of Audio and Speech pre-trained models.☆193Jul 21, 2020Updated 5 years ago