Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.org/abs/2104.11587)
☆47Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for ESResNeXt-fbsp
Users that are interested in ESResNeXt-fbsp are comparing it to the libraries listed below
Sorting:
- Source code for models described in the paper "ESResNet: Environmental Sound Classification Based on Visual Domain Models" (https://arxiv…☆34Jul 6, 2023Updated 2 years ago
- Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)☆862Sep 30, 2021Updated 4 years ago
- Learning differentiable temporal resolution on time-series data.☆37Nov 12, 2022Updated 3 years ago
- Keras implementation of conditional waveGAN. Application to knocking sound effects with emotion.☆11Jun 22, 2020Updated 5 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆417Aug 14, 2022Updated 3 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆48Dec 9, 2022Updated 3 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆25Jul 11, 2023Updated 2 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆31Feb 4, 2024Updated 2 years ago
- Official github page of Oceanship Dataset☆50Jun 11, 2024Updated last year
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- ☆13Sep 8, 2020Updated 5 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Apr 26, 2023Updated 2 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Jul 13, 2023Updated 2 years ago
- PyTorch Dataset for Speech and Music audio☆80Jul 12, 2024Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- Official implementation of the paper "An Investigation of Preprocessing Filters and Deep Learning Methods for Vessel Type Classification …☆30Apr 2, 2024Updated last year
- keras_multi_target_signal_recognition Underwater single channel acoustic multiple targets recognition using ResNet, DenseNet, and Complex…☆36Apr 1, 2022Updated 3 years ago
- ☆12Jun 1, 2024Updated last year
- パソリを使って電子マネーの明細をOFX形式に変換する☆16Dec 25, 2021Updated 4 years ago
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆77Nov 11, 2021Updated 4 years ago
- ☆14May 31, 2023Updated 2 years ago
- A clone of the official Blender repository☆18Jun 21, 2023Updated 2 years ago
- Open source code for the paper 'Music Source Separation with Generative Flow'☆26Nov 18, 2022Updated 3 years ago
- Code release for "Time Series Anomaly Detection by Cumulative Radon Features"☆12Feb 8, 2022Updated 4 years ago
- System that ranked 2nd in DCASE 2023 Challenge Task 5: Few-shot Bioacoustic Event Detection☆12Sep 5, 2024Updated last year
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated 2 years ago
- Official repository: Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrain…☆43Jul 19, 2023Updated 2 years ago
- ☆38Jul 5, 2024Updated last year
- We propose a novel approach for reconstructing human expressiveness in piano performance with a multi-layer bi-directional Transformer. (…☆20May 16, 2024Updated last year
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Feb 15, 2026Updated last month
- A library built for easier audio self-supervised training, downstream tasks evaluation☆136Sep 25, 2025Updated 5 months ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- A perceptual weighting filter loss for DNN training in speech enhancement☆24Apr 30, 2022Updated 3 years ago
- Repo for the BBCAVS10k distribution☆10Nov 27, 2024Updated last year