Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.org/abs/2104.11587)
☆46Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for ESResNeXt-fbsp
Users that are interested in ESResNeXt-fbsp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)☆869Sep 30, 2021Updated 4 years ago
- Learning differentiable temporal resolution on time-series data.☆36Nov 12, 2022Updated 3 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆423Aug 14, 2022Updated 3 years ago
- Enumerate expressions with n variables without repetition☆16Jul 11, 2023Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆49Dec 9, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆26Jul 11, 2023Updated 2 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆33Feb 4, 2024Updated 2 years ago
- Official github page of Oceanship Dataset☆59Jun 11, 2024Updated last year
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- ☆13Sep 8, 2020Updated 5 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Apr 26, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆150Jul 13, 2023Updated 2 years ago
- PyTorch Dataset for Speech and Music audio☆79Jul 12, 2024Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- Official implementation of the paper "An Investigation of Preprocessing Filters and Deep Learning Methods for Vessel Type Classification …☆31Apr 2, 2024Updated 2 years ago
- ☆12Jun 1, 2024Updated 2 years ago
- Code for Unconditional Audio Generation with GAN and Cycle Regularization☆76Nov 11, 2021Updated 4 years ago
- ☆14May 31, 2023Updated 3 years ago
- パソリを使って電子マネーの明細をOFX形式に変換する☆16Dec 25, 2021Updated 4 years ago
- Open source code for the paper 'Music Source Separation with Generative Flow'☆26Nov 18, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code release for "Time Series Anomaly Detection by Cumulative Radon Features"☆12Feb 8, 2022Updated 4 years ago
- System that ranked 2nd in DCASE 2023 Challenge Task 5: Few-shot Bioacoustic Event Detection☆12Sep 5, 2024Updated last year
- Audio Super Resolution in Python3 with Tensorflow 1.5.0 (ref. https://kuleshov.github.io/audio-super-res/)☆12Jul 10, 2018Updated 7 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated 2 years ago
- C# generative music framework☆15Sep 14, 2023Updated 2 years ago
- Official repository: Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrain…☆43Jul 19, 2023Updated 2 years ago
- Handling audio files in Python☆39May 20, 2026Updated 2 weeks ago
- We propose a novel approach for reconstructing human expressiveness in piano performance with a multi-layer bi-directional Transformer. (…☆21May 16, 2024Updated 2 years ago
- Contrastive Language-Audio Pretraining☆15May 18, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Feb 15, 2026Updated 3 months ago
- A perceptual weighting filter loss for DNN training in speech enhancement☆24Apr 30, 2022Updated 4 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed A…☆12Nov 26, 2019Updated 6 years ago
- Repo for the BBCAVS10k distribution☆10Nov 27, 2024Updated last year
- ☆12Nov 5, 2019Updated 6 years ago
- This repo hosts the code and models of "Masked Autoencoders that Listen".☆664Apr 5, 2024Updated 2 years ago