converting the pretrained tensorflow SoundNet model to pytorch
☆14Jun 15, 2022Updated 3 years ago
Alternatives and similar repositories for SoundNet_Pytorch
Users that are interested in SoundNet_Pytorch are comparing it to the libraries listed below
Sorting:
- ☆30Feb 21, 2019Updated 7 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Proximal Asynchronous SAGA☆13Nov 30, 2017Updated 8 years ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- Learn and L3 embedding from audio/video pairs☆89Apr 24, 2022Updated 3 years ago
- Reference implementation and test synthetic data for Sorted Center Time echo density measure for acoustic impulse responses☆15Mar 18, 2020Updated 5 years ago
- Graph Convolutional Module for Temporal Action Localization in Videos☆10Jul 4, 2020Updated 5 years ago
- ☆10Nov 10, 2021Updated 4 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- ☆12Jul 18, 2018Updated 7 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- A fine multimodality fusion network :)☆11Aug 9, 2021Updated 4 years ago
- ☆13Feb 8, 2017Updated 9 years ago
- Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging☆11Dec 2, 2021Updated 4 years ago
- This sample includes simeple CNN classifier for music and audio-folder dataloader just like ImageFolder in torchvision.☆11Oct 30, 2018Updated 7 years ago
- ☆12Mar 3, 2025Updated 11 months ago
- Multimodal Affective Analysis Using Hierarchical Attention Strategy☆12Dec 7, 2018Updated 7 years ago
- The material is covered in my YouTube playlist "Data Wrangling with Python" available on YUNIKARN.☆15Dec 9, 2025Updated 2 months ago
- A Tensorflow implementation of Speech Emotion Recognition using Audio signals and Text Data☆12May 16, 2022Updated 3 years ago
- ☆11May 18, 2022Updated 3 years ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.☆13Dec 18, 2021Updated 4 years ago
- 中文文本近似计算☆13Jan 22, 2019Updated 7 years ago
- Collection of ML scripts for tabular datasets☆14Mar 29, 2023Updated 2 years ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆17Dec 20, 2022Updated 3 years ago
- Facebook Hatebook Memes Challenge☆12Jan 28, 2021Updated 5 years ago
- This repo is for action recognition using Kinetics dataset with pytorch☆11Aug 5, 2019Updated 6 years ago
- ☆16Apr 27, 2025Updated 10 months ago
- Source code of ICLR2020 submisstion: Zeno++: Robust Fully Asynchronous SGD☆14Feb 2, 2020Updated 6 years ago
- Multimodal emotion recognition system of attention based vision network + audio network☆14Jul 21, 2020Updated 5 years ago
- Chapter 9: Attention and Memory Augmented Networks☆13Jul 23, 2019Updated 6 years ago
- Implementation of the multi-time-scale convolution layer used in the paper Multi-Time-Scale Convolution for Emotion Recognition from Spee…☆11Oct 22, 2019Updated 6 years ago
- Bank Marketing data classification☆12Oct 2, 2020Updated 5 years ago
- ☆14Nov 11, 2025Updated 3 months ago
- PyTorch Implementation of paper "Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification"☆11Mar 28, 2020Updated 5 years ago
- ☆11Nov 18, 2021Updated 4 years ago
- cross-modal model between audio(MFCC) and text(KoBERT)☆12Jan 14, 2021Updated 5 years ago
- cnn bilstm crf 作中文命名实体识别☆13Sep 25, 2020Updated 5 years ago