Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.
☆276Feb 8, 2026Updated last month
Alternatives and similar repositories for pytorchforaudio
Users that are interested in pytorchforaudio are comparing it to the libraries listed below
Sorting:
- Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"☆1,319Feb 8, 2026Updated last month
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆38Jan 13, 2022Updated 4 years ago
- Acoustic Scene Classification using transfer learning on VGGish pre-trained model☆11Jan 3, 2018Updated 8 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆180Updated this week
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Mar 25, 2021Updated 4 years ago
- Code and slides for the "Generating Sound with Neural Network" series on The Sound of AI Youtube channel.☆180Apr 26, 2021Updated 4 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,426May 21, 2023Updated 2 years ago
- Audio preprocessing framework for Deep Learning audio applications☆129Jan 14, 2023Updated 3 years ago
- Resources for the Generative Music AI Course on The Sound of AI YouTube channel.☆220Jan 24, 2026Updated last month
- ☆17Jul 28, 2022Updated 3 years ago
- ☆111Jul 12, 2020Updated 5 years ago
- TensorRT In Docker☆11Dec 7, 2024Updated last year
- An automatic sample identification (ASID) system using a contrastively trained GNN encoder.☆13Sep 21, 2025Updated 5 months ago
- Sound classifier tutorials/examples in PyTorch☆65May 5, 2022Updated 3 years ago
- ☆21Mar 8, 2020Updated 6 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Apr 26, 2023Updated 2 years ago
- ☆12Jun 9, 2025Updated 9 months ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Feb 24, 2023Updated 3 years ago
- Deep Semi-Supervised Learning with Holistic methods for audio classification.☆11Dec 14, 2024Updated last year
- The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a vari…☆580Dec 17, 2025Updated 2 months ago
- Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.☆28Sep 13, 2025Updated 5 months ago
- ☆25Jul 25, 2024Updated last year
- Pytorch Implementation of wavegan model to generate audio☆174Oct 7, 2020Updated 5 years ago
- ☆48Aug 30, 2024Updated last year
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆392Jun 16, 2021Updated 4 years ago
- Reproducible research code for the experiments presented in our article "Kara1k: a karaoke dataset for cover song identification and sing…☆10Jan 9, 2018Updated 8 years ago
- Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)☆14Aug 20, 2020Updated 5 years ago
- ☆14Jul 15, 2022Updated 3 years ago
- Code for YouTube series: Deep Learning for Audio Classification☆585Feb 6, 2023Updated 3 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆414Aug 14, 2022Updated 3 years ago
- Implementation of the multi-time-scale convolution layer used in the paper Multi-Time-Scale Convolution for Emotion Recognition from Spee…☆11Oct 22, 2019Updated 6 years ago
- Exercise for the textbook Data Structures and Algorithm Analysis in C++☆15Jun 4, 2021Updated 4 years ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- ☆1,674Jul 25, 2024Updated last year
- MIDI, WAV domain music emotion recognition [ISMIR 2021]☆88Oct 29, 2021Updated 4 years ago
- Python library for downloading, loading & working with sound datasets☆350Sep 23, 2025Updated 5 months ago
- Pixel VQ-VAEs for Improved Pixel Art Representation☆17Feb 11, 2023Updated 3 years ago
- cross-modal model between audio(MFCC) and text(KoBERT)☆12Jan 14, 2021Updated 5 years ago
- A web app for annotating Freesound loops, and the tools to analyse the dataset created.☆20Jul 6, 2023Updated 2 years ago