Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available on GPU.
☆20Nov 30, 2020Updated 5 years ago
Alternatives and similar repositories for nnAudio2
Users that are interested in nnAudio2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Dec 12, 2020Updated 5 years ago
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Sep 18, 2020Updated 5 years ago
- Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/☆15Oct 15, 2018Updated 7 years ago
- Accompanying code for the paper Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.☆11Jun 7, 2022Updated 3 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆24Aug 3, 2023Updated 2 years ago
- PyTorch implementation of the NSGT/sliCQT☆17Nov 10, 2023Updated 2 years ago
- ☆20May 13, 2019Updated 6 years ago
- Audio processing by using pytorch 1D convolution network☆1,120Dec 7, 2025Updated 3 months ago
- Audio samples for the paper 'Phase-aware music super-resolution using generative adversarial networks'☆14May 15, 2020Updated 5 years ago
- ☆14Apr 18, 2019Updated 6 years ago
- Template that combines PyTorch Lightning and Hydra☆15Aug 15, 2023Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Oct 13, 2023Updated 2 years ago
- Keras framework for speech enhancement using relativistic GANs☆52Jun 24, 2020Updated 5 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- Tutorial covering Open Source tools for Source Separation.☆15Nov 12, 2021Updated 4 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- ☆21Jul 15, 2024Updated last year
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year
- PyTorch Implementation of SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, a…☆22Feb 20, 2019Updated 7 years ago
- Vocal melody extraction using patch-based CNN☆32Feb 5, 2018Updated 8 years ago
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago
- Active noise controller (ANC) design: a practical primer☆13Jan 8, 2026Updated 2 months ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Mar 19, 2021Updated 5 years ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- Asteroid's filterbanks☆88Jan 12, 2025Updated last year
- Code for CVSSP submission to DCASE 2021 Task 6☆36Nov 22, 2022Updated 3 years ago
- BAD-VAE: A VAE framework for unsupervised disentanglement of sequential data☆12May 25, 2022Updated 3 years ago
- Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.☆14Jul 9, 2023Updated 2 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- ☆75Jan 6, 2020Updated 6 years ago
- Equal Loudness Filter☆11Mar 4, 2019Updated 7 years ago
- Guqin performance analysis☆12Aug 31, 2020Updated 5 years ago
- Complete implementation of MusicNet in Pytorch☆12Apr 15, 2020Updated 5 years ago
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆51Jun 12, 2025Updated 9 months ago
- SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, accepted in ICASSP 2019☆18Feb 20, 2019Updated 7 years ago
- Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"☆13Feb 22, 2022Updated 4 years ago
- Learnable STRF, from Riad et al. 2021 JASA☆13Aug 21, 2021Updated 4 years ago