Utils and data sets for audio and PyTorch
☆86Dec 30, 2021Updated 4 years ago
Alternatives and similar repositories for audtorch
Users that are interested in audtorch are comparing it to the libraries listed below
Sorting:
- A test bed for updates and new features | pytorch/audio☆171May 17, 2020Updated 5 years ago
- Handling audio files in Python☆39Feb 12, 2026Updated 2 weeks ago
- A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks.☆23Feb 23, 2026Updated last week
- Supplemental material for the paper "Towards Automatically Correcting Tapped Beat Annotations for Music Recordings"☆20May 6, 2021Updated 4 years ago
- ☆18May 15, 2021Updated 4 years ago
- Format to store media files and annotations☆12Feb 23, 2026Updated last week
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 10 years ago
- Python wrapper for Sinsy☆53Oct 9, 2023Updated 2 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorch☆88Jul 25, 2024Updated last year
- Paderbox: A collection of utilities for audio / speech processing☆43Jul 21, 2025Updated 7 months ago
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆80Jul 1, 2022Updated 3 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- PyProf2: PyTorch Profiling tool☆82Jun 25, 2020Updated 5 years ago
- Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020☆16Oct 20, 2020Updated 5 years ago
- SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours☆28May 23, 2025Updated 9 months ago
- Automatic speech recognition using neural networks☆18Nov 21, 2020Updated 5 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Apr 8, 2024Updated last year
- This repository is for an implementation of the accepted paper "Sketching the Expression: Flexible Rendering of Expressive Piano Performa…☆22Dec 15, 2022Updated 3 years ago
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆21Jul 4, 2021Updated 4 years ago
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- "Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks", ICASSP 2022☆106Nov 7, 2025Updated 3 months ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- C++ library with accompanying toolkit for music information retrieval☆16Apr 21, 2022Updated 3 years ago
- Read and write HTK and HTS files from python.☆20Mar 17, 2015Updated 10 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Wave-U-Net for automatic (drum) mixing☆38Mar 24, 2023Updated 2 years ago
- Source code repository for the SMC paper "Musical Tempo and Key Estimation using Convolutional Neural Networks with Directional Filters".☆34Mar 24, 2023Updated 2 years ago
- A pytorch implementation of FFTNet.☆37Aug 31, 2018Updated 7 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- ☆231Feb 9, 2020Updated 6 years ago
- Audio processing by using pytorch 1D convolution network☆1,117Dec 7, 2025Updated 2 months ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- Room acoustic simulator with a SOFA file loader.☆23Sep 27, 2024Updated last year
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆38Dec 16, 2024Updated last year
- Resources for "Simple Speech Representation Learning from Perceptual Data".