harritaylor/torchvggish

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/harritaylor/torchvggish)

harritaylor / torchvggish

Pytorch port of Google Research's VGGish model used for extracting audio features.

☆410

Alternatives and similar repositories for torchvggish

Users that are interested in torchvggish are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tcvrick / audioset-vggish-tensorflow-to-pytorch
View on GitHub
Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.
☆85May 16, 2019Updated 7 years ago
ksanjeevan / crnn-audio-classification
View on GitHub
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
☆391Jun 16, 2021Updated 5 years ago
qiuqiangkong / audioset_tagging_cnn
View on GitHub
☆1,766Jul 25, 2024Updated 2 years ago
qiuqiangkong / audioset_classification
View on GitHub
☆229Feb 9, 2020Updated 6 years ago
DTaoo / VGGish
View on GitHub
An implementation of vggish in keras with tf backend
☆123Apr 11, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
biboamy / AVASpeech_Music_Labels
View on GitHub
☆20Nov 3, 2021Updated 4 years ago
WangHelin1997 / GL-AT
View on GitHub
Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.
☆13Feb 6, 2021Updated 5 years ago
hche11 / VGGSound
View on GitHub
VGGSound: A Large-scale Audio-Visual Dataset
☆359Sep 13, 2021Updated 4 years ago
YuanGongND / ast
View on GitHub
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
☆1,465May 21, 2023Updated 3 years ago
marl / openl3
View on GitHub
OpenL3: Open-source deep audio and image embeddings
☆599Jun 17, 2023Updated 3 years ago
carl03q / AudioClassifier
View on GitHub
A CNN audio classifier via spectrogram images.
☆10Jul 21, 2017Updated 9 years ago
jordipons / neural-classifiers-with-few-audio
View on GitHub
Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274
☆60Feb 1, 2019Updated 7 years ago
minzwon / sota-music-tagging-models
View on GitHub
☆439Nov 1, 2023Updated 2 years ago
luuil / Tensorflow-Audio-Classification
View on GitHub
Audio classification with VGGish as feature extractor in TensorFlow
☆131Dec 4, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YapengTian / AVE-ECCV18
View on GitHub
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
☆210Apr 3, 2021Updated 5 years ago
zhangyaoyuan / NextVLAD-Attention-Model
View on GitHub
(2020) Video Classification Neural Network
☆30Feb 18, 2020Updated 6 years ago
qiuqiangkong / torchlibrosa
View on GitHub
☆512Jun 25, 2024Updated 2 years ago
JMGaljaard / VGGish-pytorch
View on GitHub
☆16Jun 17, 2021Updated 5 years ago
CSLT-THU / IS2019-VAE
View on GitHub
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
karolpiczak / ESC-50
View on GitHub
ESC-50: Dataset for Environmental Sound Classification
☆1,849Mar 20, 2024Updated 2 years ago
minzwon / semi-supervised-music-tagging-transformer
View on GitHub
☆99Nov 25, 2021Updated 4 years ago
furkanyesiler / move
View on GitHub
PyTorch code for training and evaluating MOVE, musically-motivated version embeddings
☆50Jul 6, 2023Updated 3 years ago
YuanGongND / psla
View on GitHub
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
☆150Jul 13, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
IBM / MAX-Audio-Embedding-Generator
View on GitHub
Generate embedding vectors from audio files
☆58Sep 17, 2025Updated 10 months ago
linrongc / youtube-8m
View on GitHub
Code of PhoenixLin(3rd place) in the 2nd Youtube8M Video Understanding Challenge
☆208Aug 1, 2019Updated 6 years ago
ekazakos / auditory-slow-fast
View on GitHub
Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch
☆73Sep 27, 2021Updated 4 years ago
Sma1033 / adt_with_a2md
View on GitHub
This is the material for paper "IMPROVING AUTOMATIC DRUM TRANSCRIPTION USING LARGE-SCALE AUDIO-TO-MIDI ALIGNED DATA"
☆16Dec 11, 2020Updated 5 years ago
gudgud96 / frechet-audio-distance
View on GitHub
A lightweight library for Frechet Audio Distance calculation.
☆317Feb 11, 2026Updated 5 months ago
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,162Nov 24, 2025Updated 8 months ago
JustinYuu / MM_Pyramid
View on GitHub
[ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
☆15Aug 26, 2022Updated 3 years ago
iver56 / audiomentations
View on GitHub
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
☆2,301Apr 13, 2026Updated 3 months ago
hyakuchiki / diffsynth
View on GitHub
☆48Nov 13, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zcaceres / spec_augment
View on GitHub
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆501Jun 11, 2021Updated 5 years ago
nttcslab / eval-audio-repr
View on GitHub
EVAR ~ Evaluation package for Audio Representations
☆81Feb 19, 2026Updated 5 months ago
yangdongchao / DCASE2021Task5
View on GitHub
The code for DCASE2021 task5 submission.
☆20Feb 21, 2022Updated 4 years ago
kkoutini / PaSST
View on GitHub
Efficient Training of Audio Transformers with Patchout
☆386Jan 12, 2024Updated 2 years ago
jordipons / sklearn-audio-transfer-learning
View on GitHub
A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn
☆148Nov 21, 2022Updated 3 years ago
jordipons / musicnn
View on GitHub
Pronounced as "musician", musicnn is a set of pre-trained deep convolutional neural networks for music audio tagging.
☆712Dec 11, 2023Updated 2 years ago
balavenkatesh3322 / audio-pretrained-model
View on GitHub
A collection of Audio and Speech pre-trained models.
☆192Jul 21, 2020Updated 6 years ago