This sample includes simeple CNN classifier for music and audio-folder dataloader just like ImageFolder in torchvision.
☆11Oct 30, 2018Updated 7 years ago
Alternatives and similar repositories for AudioFolder-Dataloader-PyTorch
Users that are interested in AudioFolder-Dataloader-PyTorch are comparing it to the libraries listed below
Sorting:
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆12Nov 12, 2022Updated 3 years ago
- ☆18May 28, 2025Updated 9 months ago
- Image Manipulation Detection and Localization☆10Aug 10, 2023Updated 2 years ago
- Pytorch implementation of (2+1)D spatiotemporal convolutions☆12Sep 13, 2018Updated 7 years ago
- TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition☆26Feb 5, 2026Updated last month
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath☆13Mar 18, 2022Updated 4 years ago
- ☆12May 30, 2023Updated 2 years ago
- This repo is for action recognition using Kinetics dataset with pytorch☆11Aug 5, 2019Updated 6 years ago
- Learning differentiable temporal resolution on time-series data.☆37Nov 12, 2022Updated 3 years ago
- lazy_dataset: Process large datasets as if it was an iterable.☆18Dec 1, 2025Updated 3 months ago
- converting the pretrained tensorflow SoundNet model to pytorch☆14Jun 15, 2022Updated 3 years ago
- ☆15Sep 4, 2020Updated 5 years ago
- This is the implementation of our paper: Conditional Prior Networks for Optical Flow☆20Jul 15, 2019Updated 6 years ago
- Fast-Slow Recurrent Neural Networks☆14Jan 31, 2018Updated 8 years ago
- RAZR – Room acoustics simulator for Mathwork’s MATLAB☆19Dec 13, 2017Updated 8 years ago
- EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 w/ some my ways :)☆15Oct 4, 2019Updated 6 years ago
- Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"☆17Jun 2, 2023Updated 2 years ago
- ☆19Mar 10, 2023Updated 3 years ago
- Browser-native machine learning app using ONNX Runtime Web☆28Aug 16, 2021Updated 4 years ago
- Capture, scan, and explore your surroundings in 3D with just your iPhone or iPad!☆40Apr 29, 2025Updated 10 months ago
- Octave Convolution Implementation in PyTorch☆19Jul 6, 2023Updated 2 years ago
- Torch video loading library with support for GPU decoding☆18Dec 8, 2021Updated 4 years ago
- Adapting a ConvNeXt model to audio classification on AudioSet☆27Feb 19, 2025Updated last year
- Style transfer using WCT transforms☆18Nov 5, 2019Updated 6 years ago
- Pytorch implementation of "Compact Global Descriptor for Neural Networks" (CGD).☆25Jan 9, 2025Updated last year
- ☆19Oct 1, 2024Updated last year
- Package for deploying deep learning models from TAO Toolkit☆24Dec 5, 2025Updated 3 months ago
- Official PyTorch implementation of PyramidTabNet: Transformer-based Table Recognition in Image-based Documents☆28Oct 5, 2024Updated last year
- Pytorch implement of the paper "VLDeformer: Vision Language Decomposed Transformer for Fast Cross-modal Retrieval", KBS 2022☆21Sep 18, 2022Updated 3 years ago
- MIST: Multiple Instance Spatial Transformer☆25Aug 24, 2021Updated 4 years ago
- A PyTorch implementation of "SlowFast Networks for Video Recognition"☆22Aug 20, 2019Updated 6 years ago
- This is a pytorch version of Realtime_Multi-Person_Pose_Estimation, origin code is here https://github.com/ZheC/Realtime_Multi-Person_Pos…☆20Apr 8, 2017Updated 8 years ago
- [CVPR 2026] This repository is the official implementation of MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Ref…☆98Feb 28, 2026Updated 2 weeks ago
- An adversarial autoencoder implementation in pytorch☆86May 9, 2019Updated 6 years ago
- ONNX deployment of darknet (YOLOv3/YOLOv4)☆26Mar 17, 2023Updated 3 years ago
- The repository to contain codes and models for paper "Two-stream Flow-guided Convolutional Attention Networks for Action Recognition".☆22Feb 4, 2018Updated 8 years ago
- Code for the DataPipes article☆15Jun 14, 2022Updated 3 years ago