Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available on GPU.
☆21Nov 30, 2020Updated 5 years ago
Alternatives and similar repositories for nnAudio2
Users that are interested in nnAudio2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Dec 12, 2020Updated 5 years ago
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆14Sep 18, 2020Updated 5 years ago
- Python version of http://www.ee.columbia.edu/ln/rosa/matlab/gammatonegram/☆15Oct 15, 2018Updated 7 years ago
- Accompanying code for the paper Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.☆11Jun 7, 2022Updated 4 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆24Aug 3, 2023Updated 2 years ago
- PyTorch implementation of the NSGT/sliCQT☆17Nov 10, 2023Updated 2 years ago
- ☆20May 13, 2019Updated 7 years ago
- Audio processing by using pytorch 1D convolution network☆1,127May 21, 2026Updated last month
- Audio samples for the paper 'Phase-aware music super-resolution using generative adversarial networks'☆14May 15, 2020Updated 6 years ago
- ☆14Apr 18, 2019Updated 7 years ago
- Template that combines PyTorch Lightning and Hydra☆16Aug 15, 2023Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆43Oct 13, 2023Updated 2 years ago
- Keras framework for speech enhancement using relativistic GANs☆52Jun 24, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- Tutorial covering Open Source tools for Source Separation.☆15Nov 12, 2021Updated 4 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- ☆21Jul 15, 2024Updated last year
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year
- PyTorch Implementation of SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, a…☆21Feb 20, 2019Updated 7 years ago
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago
- Active noise controller (ANC) design: a practical primer☆14Jan 8, 2026Updated 5 months ago
- Vocal melody extraction using patch-based CNN☆32Feb 5, 2018Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Mar 19, 2021Updated 5 years ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago
- Code for CVSSP submission to DCASE 2021 Task 6☆36Nov 22, 2022Updated 3 years ago
- Asteroid's filterbanks☆90Jan 12, 2025Updated last year
- BAD-VAE: A VAE framework for unsupervised disentanglement of sequential data☆12May 25, 2022Updated 4 years ago
- ☆55Jun 3, 2020Updated 6 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- ☆75Jan 6, 2020Updated 6 years ago
- Implementation of 1D, 2D, and 3D FFT convolutions in PyTorch. Much faster than direct convolutions for large kernel sizes.☆15May 18, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Equal Loudness Filter☆11Mar 4, 2019Updated 7 years ago
- Guqin performance analysis☆12Aug 31, 2020Updated 5 years ago
- Complete implementation of MusicNet in Pytorch☆12Apr 15, 2020Updated 6 years ago
- spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io☆51Jun 12, 2025Updated last year
- SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, accepted in ICASSP 2019☆18Feb 20, 2019Updated 7 years ago
- Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"☆13Feb 22, 2022Updated 4 years ago
- Learnable STRF, from Riad et al. 2021 JASA☆13Aug 21, 2021Updated 4 years ago