Alibaba-MIIL/AudioClassfication

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Alibaba-MIIL/AudioClassfication)

Alibaba-MIIL / AudioClassfication

☆90

Alternatives and similar repositories for AudioClassfication

Users that are interested in AudioClassfication are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fschmid56 / EfficientAT
View on GitHub
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆353Nov 20, 2024Updated last year
RetroCirce / HTS-Audio-Transformer
View on GitHub
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
☆502Sep 18, 2025Updated 10 months ago
DataSenseiAryan / GoogleSpeechCommandLowFootprint
View on GitHub
This repository contains the Code for SOTA model on Google Speech Command V2 dataset.
☆16Sep 28, 2023Updated 2 years ago
haoheliu / diffres-python
View on GitHub
Learning differentiable temporal resolution on time-series data.
☆36Nov 12, 2022Updated 3 years ago
cdezapasquale / transfomer-audio-classification
View on GitHub
small experimentation about positional encoding
☆20Feb 9, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AI-Research-BD / Keyword-MLP
View on GitHub
Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.
☆15Nov 5, 2022Updated 3 years ago
nttcslab / m2d
View on GitHub
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
☆162Feb 23, 2026Updated 4 months ago
AndreyGuzhov / ESResNet
View on GitHub
Source code for models described in the paper "ESResNet: Environmental Sound Classification Based on Visual Domain Models" (https://arxiv…
☆33Jul 6, 2023Updated 3 years ago
kkoutini / passt_hear21
View on GitHub
Inference code for PaSST, using the HEAR API.
☆35Jan 2, 2024Updated 2 years ago
JNAIC / PIPMN
View on GitHub
PIPMN
☆22Oct 10, 2024Updated last year
alireza-nasiri / SoundCLR
View on GitHub
Implementation for "SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification," in pytorch.
☆29Jan 18, 2024Updated 2 years ago
YuanGongND / ast
View on GitHub
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
☆1,464May 21, 2023Updated 3 years ago
vincenzodentamaro / aucoresnet
View on GitHub
AUCO ResNet: an end-to-end network for Covid-19 pre-screening from cough and breath
☆13Mar 18, 2022Updated 4 years ago
RicherMans / CED
View on GitHub
Source code for Consistent ensemble distillation for audio tagging
☆75Mar 20, 2026Updated 4 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
denfed / wave-spec-fusion
View on GitHub
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…
☆16Aug 9, 2021Updated 4 years ago
jfainberg / sincnet_adapt
View on GitHub
Raw waveform adaptation with SincNet
☆12Mar 19, 2024Updated 2 years ago
karolpiczak / ESC-50
View on GitHub
ESC-50: Dataset for Environmental Sound Classification
☆1,850Mar 20, 2024Updated 2 years ago
roman-vygon / triplet_loss_kws
View on GitHub
Learning Efficient Representations for Keyword Spotting with Triplet Loss
☆115Sep 14, 2022Updated 3 years ago
jamesdeep / VitalSign
View on GitHub
☆15Jul 24, 2021Updated 4 years ago
Hadryan / TFNet-for-Environmental-Sound-Classification
View on GitHub
Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (…
☆31Dec 19, 2019Updated 6 years ago
WangHelin1997 / DCASE-2020-Task1A-Code
View on GitHub
A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.
☆20Dec 12, 2020Updated 5 years ago
vinceasvp / meta-sc
View on GitHub
☆11May 30, 2023Updated 3 years ago
Sara-Ahmed / ASiT
View on GitHub
ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation
☆30Mar 10, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
kkoutini / PaSST
View on GitHub
Efficient Training of Audio Transformers with Patchout
☆386Jan 12, 2024Updated 2 years ago
AmirmohammadRostami / KeywordsSpotting-EfficientNet-A0
View on GitHub
EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting
☆23Jun 16, 2022Updated 4 years ago
nttcslab / eval-audio-repr
View on GitHub
EVAR ~ Evaluation package for Audio Representations
☆81Feb 19, 2026Updated 5 months ago
YuanGongND / psla
View on GitHub
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
☆150Jul 13, 2023Updated 3 years ago
WangHelin1997 / MaskSpec
View on GitHub
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
☆51Dec 17, 2024Updated last year
wangyu / rethink-audio-fsl
View on GitHub
Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)
☆43May 24, 2022Updated 4 years ago
leibniz-future-lab / SelfDistill-SER
View on GitHub
☆18Apr 28, 2023Updated 3 years ago
yeyupiaoling / AudioClassification-Pytorch
View on GitHub
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a vari…
☆599Dec 17, 2025Updated 7 months ago
YuanGongND / vocalsound
View on GitHub
Dataset and baseline code for the VocalSound dataset (ICASSP2022).
☆165Nov 12, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mispchallenge / misp2021_baseline
View on GitHub
☆29Jun 15, 2022Updated 4 years ago
ictnlp / CMOT
View on GitHub
Code for ACL 2023 main conference paper "CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation"
☆17Oct 29, 2024Updated last year
moharamfatema / heartbeat-sounds
View on GitHub
Heart Sound Segmentation And Classification | Kaggle Competition
☆16Jan 25, 2023Updated 3 years ago
CPJKU / cpjku_dcase22
View on GitHub
☆19Jul 15, 2022Updated 4 years ago
Gvith / Heart-Sound-Classification
View on GitHub
☆21Sep 30, 2017Updated 8 years ago
Benjamin-Walker / heart-murmur-detection
View on GitHub
Dual Bayesian ResNet: A Deep Learning Approach to Heart Murmur Detection (Physionet Challenge 2022)
☆23Oct 1, 2025Updated 9 months ago
nttcslab / byol-a
View on GitHub
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
☆237Apr 26, 2023Updated 3 years ago