converting the pretrained tensorflow SoundNet model to pytorch
☆14Jun 15, 2022Updated 3 years ago
Alternatives and similar repositories for SoundNet_Pytorch
Users that are interested in SoundNet_Pytorch are comparing it to the libraries listed below
Sorting:
- For easier and more readable tensorflow codes☆13Sep 1, 2019Updated 6 years ago
- ☆18Feb 21, 2019Updated 7 years ago
- This sample includes simeple CNN classifier for music and audio-folder dataloader just like ImageFolder in torchvision.☆11Oct 30, 2018Updated 7 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- Graph Convolutional Module for Temporal Action Localization in Videos☆10Jul 4, 2020Updated 5 years ago
- Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering☆18Oct 30, 2019Updated 6 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- an R package to perform synchronization analysis on motion energy time-series☆16Dec 13, 2024Updated last year
- Octave Convolution Implementation in PyTorch☆19Jul 6, 2023Updated 2 years ago
- This repository provides the dataset introduced by our WSSTG paper☆13Jul 21, 2019Updated 6 years ago
- Bank Marketing data classification☆12Oct 2, 2020Updated 5 years ago
- Extract video features. Currently, the models includes I3D, will be continuously updated.☆12Jun 4, 2020Updated 5 years ago
- TensorFlow implementation of "SoundNet".☆145Mar 26, 2018Updated 7 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Jul 9, 2020Updated 5 years ago
- 中文文本近似计算☆13Jan 22, 2019Updated 7 years ago
- Reference implementation and test synthetic data for Sorted Center Time echo density measure for acoustic impulse responses☆15Mar 18, 2020Updated 6 years ago
- Torch video loading library with support for GPU decoding☆18Dec 8, 2021Updated 4 years ago
- ☆13Mar 25, 2021Updated 4 years ago
- Style transfer using WCT transforms☆18Nov 5, 2019Updated 6 years ago
- A series of models applying memory augmented neural networks to machine translation☆15May 3, 2018Updated 7 years ago
- ☆16Apr 27, 2025Updated 10 months ago
- Memory Augmented Neural Networks (Pytorch)☆14Sep 2, 2018Updated 7 years ago
- Pytorch implementation of "Compact Global Descriptor for Neural Networks" (CGD).☆25Jan 9, 2025Updated last year
- The official repository of paper "Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark"☆20Jun 20, 2025Updated 9 months ago
- ☆11May 18, 2022Updated 3 years ago
- 依照The Annotated Transformer 的指导实现Transformer, 并加入进去详细的描述,适合小白☆11Feb 2, 2020Updated 6 years ago
- PyTorch implementation of Memory Augmented Neural Network☆10Jun 27, 2020Updated 5 years ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago
- Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging☆11Dec 2, 2021Updated 4 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- Video object detection related papers after 2017 and my reading notes.☆16Oct 24, 2018Updated 7 years ago
- ☆18Jun 20, 2020Updated 5 years ago
- Inference code for PaSST, using the HEAR API.☆32Jan 2, 2024Updated 2 years ago
- A PyTorch implementation of "SlowFast Networks for Video Recognition"☆22Aug 20, 2019Updated 6 years ago
- LAEO-Net++☆21Mar 24, 2021Updated 4 years ago
- ☆20Sep 28, 2020Updated 5 years ago
- Tensorflow Reproduction of the EMNLP-2018 paper "Temporally Grounding Natural Sentence in Video"☆17Nov 21, 2022Updated 3 years ago