Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.org/abs/2104.11587)
☆47Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for ESResNeXt-fbsp
Users that are interested in ESResNeXt-fbsp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for models described in the paper "ESResNet: Environmental Sound Classification Based on Visual Domain Models" (https://arxiv…☆34Jul 6, 2023Updated 2 years ago
- Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)☆864Sep 30, 2021Updated 4 years ago
- Keras implementation of conditional waveGAN. Application to knocking sound effects with emotion.☆11Jun 22, 2020Updated 5 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆419Aug 14, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆26Jul 11, 2023Updated 2 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆33Feb 4, 2024Updated 2 years ago
- Official github page of Oceanship Dataset☆53Jun 11, 2024Updated last year
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- ☆13Sep 8, 2020Updated 5 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Apr 26, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆150Jul 13, 2023Updated 2 years ago
- PyTorch Dataset for Speech and Music audio☆79Jul 12, 2024Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- keras_multi_target_signal_recognition Underwater single channel acoustic multiple targets recognition using ResNet, DenseNet, and Complex…☆37Apr 1, 2022Updated 4 years ago
- Official implementation of the paper "An Investigation of Preprocessing Filters and Deep Learning Methods for Vessel Type Classification …☆31Apr 2, 2024Updated 2 years ago
- ☆12Jun 1, 2024Updated last year
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆76Nov 11, 2021Updated 4 years ago
- A clone of the official Blender repository☆18Jun 21, 2023Updated 2 years ago
- Open source code for the paper 'Music Source Separation with Generative Flow'☆26Nov 18, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code release for "Time Series Anomaly Detection by Cumulative Radon Features"☆12Feb 8, 2022Updated 4 years ago
- System that ranked 2nd in DCASE 2023 Challenge Task 5: Few-shot Bioacoustic Event Detection☆12Sep 5, 2024Updated last year
- Audio Super Resolution in Python3 with Tensorflow 1.5.0 (ref. https://kuleshov.github.io/audio-super-res/)☆12Jul 10, 2018Updated 7 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated 2 years ago
- Official repository: Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrain…☆43Jul 19, 2023Updated 2 years ago
- ☆37Jul 5, 2024Updated last year
- Handling audio files in Python☆39Apr 15, 2026Updated 2 weeks ago
- We propose a novel approach for reconstructing human expressiveness in piano performance with a multi-layer bi-directional Transformer. (…☆21May 16, 2024Updated last year
- Contrastive Language-Audio Pretraining☆15May 18, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Feb 15, 2026Updated 2 months ago
- A perceptual weighting filter loss for DNN training in speech enhancement☆24Apr 30, 2022Updated 3 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- Repo for the BBCAVS10k distribution☆10Nov 27, 2024Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆138Sep 25, 2025Updated 7 months ago
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed A…☆12Nov 26, 2019Updated 6 years ago