Pytorch port of Google Research's VGGish model used for extracting audio features.
☆410Nov 3, 2021Updated 4 years ago
Alternatives and similar repositories for torchvggish
Users that are interested in torchvggish are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.☆87May 16, 2019Updated 6 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆394Jun 16, 2021Updated 4 years ago
- ☆1,711Jul 25, 2024Updated last year
- ☆231Feb 9, 2020Updated 6 years ago
- An implementation of vggish in keras with tf backend☆123Apr 11, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆20Nov 3, 2021Updated 4 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆357Sep 13, 2021Updated 4 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,448May 21, 2023Updated 2 years ago
- OpenL3: Open-source deep audio and image embeddings☆582Jun 17, 2023Updated 2 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Feb 1, 2019Updated 7 years ago
- ☆436Nov 1, 2023Updated 2 years ago
- Audio classification with VGGish as feature extractor in TensorFlow☆131Dec 4, 2021Updated 4 years ago
- ☆508Jun 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018☆208Apr 3, 2021Updated 5 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- (2020) Video Classification Neural Network☆30Feb 18, 2020Updated 6 years ago
- ESC-50: Dataset for Environmental Sound Classification☆1,794Mar 20, 2024Updated 2 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 3 years ago
- ☆99Nov 25, 2021Updated 4 years ago
- PyTorch code for training and evaluating MOVE, musically-motivated version embeddings☆50Jul 6, 2023Updated 2 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆150Jul 13, 2023Updated 2 years ago
- ☆17Jun 17, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Generate embedding vectors from audio files☆60Sep 17, 2025Updated 7 months ago
- Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch☆73Sep 27, 2021Updated 4 years ago
- Code of PhoenixLin(3rd place) in the 2nd Youtube8M Video Understanding Challenge☆208Aug 1, 2019Updated 6 years ago
- This is the material for paper "IMPROVING AUTOMATIC DRUM TRANSCRIPTION USING LARGE-SCALE AUDIO-TO-MIDI ALIGNED DATA"☆16Dec 11, 2020Updated 5 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,143Nov 24, 2025Updated 4 months ago
- A lightweight library for Frechet Audio Distance calculation.☆313Feb 11, 2026Updated 2 months ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆500Jun 11, 2021Updated 4 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,252Dec 27, 2025Updated 3 months ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆146Nov 21, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- ☆47Nov 13, 2021Updated 4 years ago
- Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.☆691Dec 11, 2023Updated 2 years ago
- The code for DCASE2021 task5 submission.☆20Feb 21, 2022Updated 4 years ago
- Efficient Training of Audio Transformers with Patchout☆374Jan 12, 2024Updated 2 years ago
- A collection of Audio and Speech pre-trained models.☆193Jul 21, 2020Updated 5 years ago
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆99Dec 4, 2024Updated last year