PyTorch Dataset for Speech and Music audio
☆80Jul 12, 2024Updated last year
Alternatives and similar repositories for AudioLoader
Users that are interested in AudioLoader are comparing it to the libraries listed below
Sorting:
- ☆23Aug 30, 2022Updated 3 years ago
- Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features☆84May 3, 2023Updated 2 years ago
- Tools for Analyzing Popularity and Semantic Diversity of a Playlist Dataset☆10Jun 17, 2024Updated last year
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Apr 11, 2022Updated 3 years ago
- ☆180Oct 24, 2023Updated 2 years ago
- source code for the paper publised in IJCNN 2020 "The Impact of Audio Input Representations on Neural Network based Music Transcription"☆13Apr 9, 2020Updated 5 years ago
- ☆17Jan 20, 2025Updated last year
- Evaluation kit for the HEAR Benchmark☆62Feb 12, 2026Updated 2 weeks ago
- source code of "End-to-end Music Remastering System Using Self-supervised and Adversarial Training"☆47Sep 7, 2023Updated 2 years ago
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…☆87Nov 13, 2022Updated 3 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Feb 9, 2021Updated 5 years ago
- ☆17Oct 16, 2018Updated 7 years ago
- experiments about AudioSet☆43Jul 22, 2023Updated 2 years ago
- Audio processing by using pytorch 1D convolution network☆1,117Dec 7, 2025Updated 2 months ago
- Official Implementation of Jointist☆37Jul 26, 2023Updated 2 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- ☆24Mar 24, 2023Updated 2 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- ☆20Aug 26, 2022Updated 3 years ago
- Frechet Audio Distance evaluation in PyTorch☆36Jun 9, 2023Updated 2 years ago
- A lightweight library for Frechet Audio Distance calculation.☆309Feb 11, 2026Updated 2 weeks ago
- Supplementary code for the experiments described in the 2021 ISMIR submission: Leveraging Hierarchical Structures for Few Shot Musical In…☆41Aug 12, 2022Updated 3 years ago
- Python library for downloading, loading & working with sound datasets☆350Sep 23, 2025Updated 5 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- Deep Performer: Score-to-audio music performance synthesis☆44Jun 26, 2023Updated 2 years ago
- ☆132Jan 6, 2023Updated 3 years ago
- ☆13Jun 2, 2022Updated 3 years ago
- MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage☆49Jun 24, 2025Updated 8 months ago
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- Audio transformations library for PyTorch☆236Apr 19, 2022Updated 3 years ago
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43May 24, 2022Updated 3 years ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- ☆251Feb 14, 2024Updated 2 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Source code for the paper 'Audio Captioning Transformer'☆57Jan 18, 2022Updated 4 years ago
- Dataset and baseline for the first Audiocaption task☆79Jul 25, 2024Updated last year