tcvrick/audioset-vggish-tensorflow-to-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tcvrick/audioset-vggish-tensorflow-to-pytorch)

tcvrick / audioset-vggish-tensorflow-to-pytorch

Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.

☆85

Alternatives and similar repositories for audioset-vggish-tensorflow-to-pytorch

Users that are interested in audioset-vggish-tensorflow-to-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

harritaylor / torchvggish
View on GitHub
Pytorch port of Google Research's VGGish model used for extracting audio features.
☆410Nov 3, 2021Updated 4 years ago
JMGaljaard / VGGish-pytorch
View on GitHub
☆16Jun 17, 2021Updated 5 years ago
azarmehri / lung-sound-vggish
View on GitHub
Implementation of IEEE Access paper - Lung Sound Recognition Algorithm Based on VGGish-BiGRU
☆30Feb 1, 2020Updated 6 years ago
JennyXieJiayi / HMMVED
View on GitHub
The implementation of HMMVED.
☆18Jul 20, 2022Updated 4 years ago
kingback2019 / Speech_MFCC_GFCC_Python
View on GitHub
求取语音的MFCC参数和GFCC参数，可用于语音信号特征提取
☆10Jul 19, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DTaoo / Discriminative-Sounding-Objects-Localization
View on GitHub
Code for Discriminative Sounding Objects Localization (NeurIPS 2020)
☆61Jan 19, 2022Updated 4 years ago
wheidima / MDN
View on GitHub
☆12Feb 23, 2021Updated 5 years ago
icon-lab / HST
View on GitHub
Official implementation of Hierarchical Spectrogram Transformers (HST)
☆20Oct 10, 2022Updated 3 years ago
lixiangucas01 / GLAM
View on GitHub
This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…
☆49Apr 11, 2022Updated 4 years ago
YJango / speech-emotion-recognition-exercise
View on GitHub
2018年7⽉30⽇-8⽉13⽇持续2周的AI训练营中语⾳情感识别营的项目报告。
☆96Sep 10, 2018Updated 7 years ago
zhongzhh8 / Video-classification-with-knowledge-distillation
View on GitHub
Video classification, youtube8m, Knowledge distillation, Tensorflow, NeXtVLAD
☆26Sep 5, 2019Updated 6 years ago
facebookresearch / AVID-CMA
View on GitHub
Audio Visual Instance Discrimination with Cross-Modal Agreement
☆133Aug 13, 2021Updated 4 years ago
JuanFMontesinos / Solos
View on GitHub
Solos: A Dataset for Audio-Visual Music Analysis
☆24Feb 17, 2023Updated 3 years ago
furkanyesiler / move
View on GitHub
PyTorch code for training and evaluating MOVE, musically-motivated version embeddings
☆50Jul 6, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
DTaoo / Simplified_DMC
View on GitHub
A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)
☆19May 27, 2020Updated 6 years ago
midas-research / speechmix
View on GitHub
☆12Oct 2, 2020Updated 5 years ago
hohsiangwu / rethinking-visual-sound-localization
View on GitHub
Official implementation of the paper How to Listen? Rethinking Visual Sound Localization
☆18Apr 25, 2022Updated 4 years ago
dharwath / DAVEnet-pytorch
View on GitHub
Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch
☆66Aug 31, 2018Updated 7 years ago
hbredin / DomainAdversarialVoiceActivityDetection
View on GitHub
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
☆23Mar 3, 2020Updated 6 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
suzuki256 / dog-dataset
View on GitHub
☆47Jul 15, 2022Updated 4 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
ilaria-manco / music-audio-tagging-pytorch
View on GitHub
A PyTorch implementation of the musicnn model for music audio tagging
☆37Jul 25, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ardasnck / learning_to_localize_sound_source
View on GitHub
Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes
☆102Dec 4, 2024Updated last year
caillonantoine / NIME_workshop
View on GitHub
☆14Sep 21, 2022Updated 3 years ago
mdx-tutorial / mdx-tutorial.github.io
View on GitHub
Tutorial covering Open Source tools for Source Separation.
☆15Nov 12, 2021Updated 4 years ago
qiuqiangkong / audioset_source_separation
View on GitHub
☆17Feb 14, 2020Updated 6 years ago
eloimoliner / unconditional-diff-STFT
View on GitHub
Unconditional music synthesis using a diffusion model in the STFT domain
☆12May 31, 2022Updated 4 years ago
hearbenchmark / hear-baseline
View on GitHub
Simple baseline model for the HEAR benchmark
☆23Feb 17, 2026Updated 5 months ago
ws-choi / Conditioned-Source-Separation-LaSAFT
View on GitHub
A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…
☆87Nov 13, 2022Updated 3 years ago
lessonxmk / head_fusion
View on GitHub
☆18May 7, 2020Updated 6 years ago
djmoffat / pyCompressor
View on GitHub
A python implementation of a traditional Dynamic Range Compressor
☆14Oct 30, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
paarthneekhara / advoc
View on GitHub
Vocode spectrograms to audio with generative adversarial networks
☆64Aug 8, 2019Updated 6 years ago
ta603 / RefinPaint
View on GitHub
☆12Jul 5, 2024Updated 2 years ago
jdasam / traeumerAI
View on GitHub
Repository of TräumerAI, based on PyTorch implementation of StyleGAN 2
☆31Aug 1, 2021Updated 4 years ago
aframires / TIVlib
View on GitHub
TIVlib is an open-source library for the content-based tonal description of musical audio signals.
☆55Sep 17, 2024Updated last year
pomonam / AttentionCluster
View on GitHub
TensorFlow Implementation of "Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification".
☆41Sep 12, 2018Updated 7 years ago
sucv / ABAW2
View on GitHub
☆15Sep 24, 2021Updated 4 years ago
keunwoochoi / music4all_contrib
View on GitHub
☆32Dec 29, 2020Updated 5 years ago