The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
☆586Dec 17, 2025Updated 3 months ago
Alternatives and similar repositories for AudioClassification-Pytorch
Users that are interested in AudioClassification-Pytorch are comparing it to the libraries listed below
Sorting:
- 基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法☆101Dec 17, 2025Updated 3 months ago
- A python code based on pytorch applied to AudioClassification☆48Jul 15, 2022Updated 3 years ago
- Python的音频工具☆16Dec 5, 2025Updated 3 months ago
- This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…☆1,249Dec 17, 2025Updated 3 months ago
- 基于Tensorflow实现声音分类,博客地址:☆107May 8, 2020Updated 5 years ago
- ☆1,680Jul 25, 2024Updated last year
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆393Jun 16, 2021Updated 4 years ago
- 城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN☆97Apr 15, 2019Updated 6 years ago
- 基于Pytorch实现的语音情感识别☆261Dec 17, 2025Updated 3 months ago
- ESC-50: Dataset for Environmental Sound Classification☆1,765Mar 20, 2024Updated 2 years ago
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆307Dec 17, 2025Updated 3 months ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Mar 25, 2021Updated 4 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆794Apr 11, 2024Updated last year
- ☆88May 27, 2023Updated 2 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,438May 21, 2023Updated 2 years ago
- 基于梅尔频谱的信号分类和识别☆23Mar 31, 2023Updated 2 years ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆476Sep 18, 2025Updated 6 months ago
- Learning differentiable temporal resolution on time-series data.☆37Nov 12, 2022Updated 3 years ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆17Nov 19, 2024Updated last year
- Method for Splitting the DeepShip Dataset☆61Nov 21, 2025Updated 4 months ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆334Nov 20, 2024Updated last year
- Official implementation of Hierarchical Spectrogram Transformers (HST)☆20Oct 10, 2022Updated 3 years ago
- Leveraging BERT to Improve Spoken Language Identification☆17Nov 22, 2022Updated 3 years ago
- The PyTorch code for "Unraveling Complex Data Diversity in Underwater Acoustic Target Recognition through Convolution-based Mixture of Ex…☆31Mar 5, 2024Updated 2 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Jul 13, 2023Updated 2 years ago
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆31Feb 4, 2024Updated 2 years ago
- Classification of Urban sounds using several classification methods, namely SVM, MLP and CNN using MFCC features.☆13Apr 15, 2020Updated 5 years ago
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆724Dec 17, 2025Updated 3 months ago
- Pytorch port of Google Research's VGGish model used for extracting audio features.☆409Nov 3, 2021Updated 4 years ago
- ☆21Mar 8, 2020Updated 6 years ago
- The official implementation of the paper "A spatio-temporal deep learning approach for underwater acoustic signals classification". In th…☆31Apr 6, 2023Updated 2 years ago
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆72Mar 11, 2025Updated last year
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,836Dec 8, 2025Updated 3 months ago
- ☆11Jun 14, 2024Updated last year
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆1,237Updated this week
- Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别☆1,285Mar 25, 2023Updated 2 years ago
- Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.☆276Feb 8, 2026Updated last month
- ☆21Apr 27, 2024Updated last year