AndreyGuzhov/ESResNeXt-fbsp

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AndreyGuzhov/ESResNeXt-fbsp)

AndreyGuzhov / ESResNeXt-fbsp

Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.org/abs/2104.11587)

☆46

Alternatives and similar repositories for ESResNeXt-fbsp

Users that are interested in ESResNeXt-fbsp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AndreyGuzhov / AudioCLIP
View on GitHub
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
☆872Sep 30, 2021Updated 4 years ago
haoheliu / diffres-python
View on GitHub
Learning differentiable temporal resolution on time-series data.
☆36Nov 12, 2022Updated 3 years ago
haoheliu / DCASE_2022_Task_5
View on GitHub
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆28Jul 6, 2022Updated 4 years ago
adrianbarahona / conditional_wavegan_knocking_sounds
View on GitHub
Keras implementation of conditional waveGAN. Application to knocking sound effects with emotion.
☆10Jun 22, 2020Updated 6 years ago
YuanGongND / ssast
View on GitHub
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
☆426Aug 14, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CPJKU / EfficientLEAF
View on GitHub
Official implementation of EfficientLEAF, a learnable audio frontend.
☆50Dec 9, 2022Updated 3 years ago
MaigoAkisame / enumerate-expressions
View on GitHub
Enumerate expressions with n variables without repetition
☆16Jul 11, 2023Updated 3 years ago
swagshaw / ASC-CL
View on GitHub
Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification
☆14Jul 19, 2022Updated 4 years ago
pritamqu / CrissCross
View on GitHub
[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity
☆26Jul 11, 2023Updated 3 years ago
biboamy / music-repro
View on GitHub
☆17Nov 7, 2023Updated 2 years ago
ilyassmoummad / scl_icbhi2017
View on GitHub
PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)
☆33Feb 4, 2024Updated 2 years ago
nttcslab / composing-general-audio-repr
View on GitHub
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model
☆26Apr 26, 2023Updated 3 years ago
YuanGongND / psla
View on GitHub
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
☆150Jul 13, 2023Updated 3 years ago
QinggangSUN / keras_multiple_target_recognition
View on GitHub
keras_multi_target_signal_recognition Underwater single channel acoustic multiple targets recognition using ResNet, DenseNet, and Complex…
☆37Apr 1, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Anynoumsiccv9970 / G2P-DDM
View on GitHub
☆14May 31, 2023Updated 3 years ago
diggerdu / AudioMamba
View on GitHub
☆12Jun 1, 2024Updated 2 years ago
ilyassmoummad / dcase23_task5_scl
View on GitHub
System that ranked 2nd in DCASE 2023 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆12Sep 5, 2024Updated last year
mohaimenz / acdnet
View on GitHub
Official repository: Environmental Sound Classiﬁcation on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrain…
☆43Jul 19, 2023Updated 3 years ago
Bai-YT / ConsistencyTTA
View on GitHub
ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
☆39Nov 20, 2024Updated last year
team178 / team178.github.io
View on GitHub
The 2nd Law Enforcers' Website
☆14Jan 28, 2024Updated 2 years ago
tangjjbetsy / RHEPP-Transformer
View on GitHub
We propose a novel approach for reconstructing human expressiveness in piano performance with a multi-layer bi-directional Transformer. (…
☆21May 16, 2024Updated 2 years ago
ShaheenPerveen / speech-emotion-recognition-iemocap
View on GitHub
Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …
☆41Mar 7, 2024Updated 2 years ago
lucidrains / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆15May 18, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Sytronik / deep-griffinlim-iteration
View on GitHub
PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)
☆39Oct 12, 2019Updated 6 years ago
hearbenchmark / hear2021-submitted-models
View on GitHub
Open-source audio embedding models, submitted to the HEAR 2021 challenge
☆11Feb 15, 2026Updated 5 months ago
ChandlerGuan / Transkimmer
View on GitHub
Code for ACL2022 publication Transkimmer: Transformer Learns to Layer-wise Skim
☆22Aug 21, 2022Updated 3 years ago
edufonseca / uclser20
View on GitHub
Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.
☆93Dec 22, 2022Updated 3 years ago
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
zhengzx-nlp / MGNMT
View on GitHub
☆15Oct 19, 2021Updated 4 years ago
EricLee8 / SPACE
View on GitHub
The official codes for our paper at COLING 2022: Semantic-Preserving Adversarial Code Comprehension
☆12Oct 23, 2022Updated 3 years ago
bbc / dsrp_bbcavs10k_distribution
View on GitHub
Repo for the BBCAVS10k distribution
☆10Nov 27, 2024Updated last year
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZET-Speech / ZET-Speech-Demo
View on GitHub
ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)
☆10Mar 9, 2024Updated 2 years ago
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
AHruler / Exploring-AAD
View on GitHub
Exploring possible methods for Audio Anomaly Detection - on machine sounds (MIMII dataset)
☆19Sep 12, 2025Updated 10 months ago
ciaua / unagan
View on GitHub
Code for Unconditional Audio Generation with GAN and Cycle Regularization
☆76Nov 11, 2021Updated 4 years ago
Neclow / SERAB
View on GitHub
SERAB: a multi-lingual benchmark for speech emotion recognition
☆28Dec 16, 2022Updated 3 years ago
hvy / chainer-faster-rcnn
View on GitHub
☆10Apr 22, 2016Updated 10 years ago
sainathadapa / dcase2019-task5-urban-sound-tagging
View on GitHub
1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging
☆30Mar 19, 2021Updated 5 years ago