yeyupiaoling/AudioClassification-Pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yeyupiaoling/AudioClassification-Pytorch)

yeyupiaoling / AudioClassification-Pytorch

The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.

☆599

Alternatives and similar repositories for AudioClassification-Pytorch

Users that are interested in AudioClassification-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yeyupiaoling / AudioClassification-PaddlePaddle
View on GitHub
基于PaddlePaddle实现的音频分类，支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型，还有多种预处理方法
☆102Dec 17, 2025Updated 7 months ago
KingH12138 / Pytorch-AudioClassification-master
View on GitHub
A python code based on pytorch applied to AudioClassification
☆48Jul 15, 2022Updated 4 years ago
yeyupiaoling / YeAudio
View on GitHub
Python的音频工具
☆16Dec 5, 2025Updated 7 months ago
yeyupiaoling / VoiceprintRecognition-Pytorch
View on GitHub
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…
☆1,303Dec 17, 2025Updated 7 months ago
qiuqiangkong / audioset_tagging_cnn
View on GitHub
☆1,765Jul 25, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yeyupiaoling / AudioClassification-Tensorflow
View on GitHub
基于Tensorflow实现声音分类，博客地址：
☆107May 8, 2020Updated 6 years ago
ksanjeevan / crnn-audio-classification
View on GitHub
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
☆391Jun 16, 2021Updated 5 years ago
kadoufall / Urban-Sound-Classification-VS
View on GitHub
城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN
☆97Apr 15, 2019Updated 7 years ago
yeyupiaoling / SpeechEmotionRecognition-Pytorch
View on GitHub
基于Pytorch实现的语音情感识别
☆269Dec 17, 2025Updated 7 months ago
karolpiczak / ESC-50
View on GitHub
ESC-50: Dataset for Environmental Sound Classification
☆1,850Mar 20, 2024Updated 2 years ago
yeyupiaoling / VoiceprintRecognition-PaddlePaddle
View on GitHub
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
☆318Dec 17, 2025Updated 7 months ago
Friedrich-M / Audio-signal-classification-and-identification
View on GitHub
基于梅尔频谱的信号分类和识别
☆23Mar 31, 2023Updated 3 years ago
kamalesh0406 / Audio-Classification
View on GitHub
Pytorch code for "Rethinking CNN Models for Audio Classification"
☆129Mar 25, 2021Updated 5 years ago
TaoRuijie / ECAPA-TDNN
View on GitHub
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
☆823Apr 11, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Alibaba-MIIL / AudioClassfication
View on GitHub
☆90May 27, 2023Updated 3 years ago
YuanGongND / ast
View on GitHub
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
☆1,464May 21, 2023Updated 3 years ago
RetroCirce / HTS-Audio-Transformer
View on GitHub
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
☆502Sep 18, 2025Updated 10 months ago
haoheliu / diffres-python
View on GitHub
Learning differentiable temporal resolution on time-series data.
☆36Nov 12, 2022Updated 3 years ago
swagshaw / WildDESED
View on GitHub
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
☆18Nov 19, 2024Updated last year
fschmid56 / EfficientAT
View on GitHub
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆353Nov 20, 2024Updated last year
icon-lab / HST
View on GitHub
Official implementation of Hierarchical Spectrogram Transformers (HST)
☆20Oct 10, 2022Updated 3 years ago
THUsatlab / BERT-LID
View on GitHub
Leveraging BERT to Improve Spoken Language Identification
☆17Nov 22, 2022Updated 3 years ago
ZhuPengsen / Method-for-Splitting-the-DeepShip-Dataset
View on GitHub
Method for Splitting the DeepShip Dataset
☆70Nov 21, 2025Updated 8 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ilyassmoummad / scl_icbhi2017
View on GitHub
PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)
☆33Feb 4, 2024Updated 2 years ago
YuanX9 / UATR-CMoE
View on GitHub
The PyTorch code for "Unraveling Complex Data Diversity in Underwater Acoustic Target Recognition through Convolution-based Mixture of Ex…
☆33Mar 5, 2024Updated 2 years ago
qinxiaoyi / Simple-Attention-Module-based-Speaker-Verification-with-Iterative-Noisy-Label-Detection
View on GitHub
☆12Jun 14, 2022Updated 4 years ago
YuanGongND / psla
View on GitHub
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
☆150Jul 13, 2023Updated 3 years ago
akshaypunwatkar / Sound_classification_urbansound8k
View on GitHub
Classification of Urban sounds using several classification methods, namely SVM, MLP and CNN using MFCC features.
☆13Apr 15, 2020Updated 6 years ago
yeyupiaoling / MASR
View on GitHub
Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。
☆727Jul 6, 2026Updated 2 weeks ago
geekysethi / audio_classification
View on GitHub
☆21Mar 8, 2020Updated 6 years ago
harritaylor / torchvggish
View on GitHub
Pytorch port of Google Research's VGGish model used for extracting audio features.
☆410Nov 3, 2021Updated 4 years ago
raymin0223 / patch-mix_contrastive_learning
View on GitHub
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)
☆76Mar 11, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
wenet-e2e / wespeaker
View on GitHub
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
☆1,359Jul 8, 2026Updated 2 weeks ago
SSTC-Challenge / SSTC2024_baseline_system
View on GitHub
☆12Jun 14, 2024Updated 2 years ago
Renovamen / Speech-Emotion-Recognition
View on GitHub
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
☆1,310Mar 25, 2023Updated 3 years ago
musikalkemist / pytorchforaudio
View on GitHub
Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.
☆279Feb 8, 2026Updated 5 months ago
Anaesthesiaye / sound_event_detection_transformer
View on GitHub
code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)
☆46May 9, 2022Updated 4 years ago
modelscope / 3D-Speaker
View on GitHub
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
☆3,060Dec 8, 2025Updated 7 months ago
kaistmm / Audio-Mamba-AuM
View on GitHub
Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"
☆172Nov 24, 2024Updated last year