The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
☆578Dec 17, 2025Updated 2 months ago
Alternatives and similar repositories for AudioClassification-Pytorch
Users that are interested in AudioClassification-Pytorch are comparing it to the libraries listed below
Sorting:
- 基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法☆100Dec 17, 2025Updated 2 months ago
- This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…☆1,239Dec 17, 2025Updated 2 months ago
- A python code based on pytorch applied to AudioClassification☆48Jul 15, 2022Updated 3 years ago
- Python的音频工具☆16Dec 5, 2025Updated 2 months ago
- ☆1,668Jul 25, 2024Updated last year
- 基于Pytorch实现的语音情感识别☆258Dec 17, 2025Updated 2 months ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆392Jun 16, 2021Updated 4 years ago
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆304Dec 17, 2025Updated 2 months ago
- 基于Tensorflow实现声音分类,博客地址:☆107May 8, 2020Updated 5 years ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Mar 25, 2021Updated 4 years ago
- 城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN☆97Apr 15, 2019Updated 6 years ago
- ESC-50: Dataset for Environmental Sound Classification☆1,746Mar 20, 2024Updated last year
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆472Sep 18, 2025Updated 5 months ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆790Apr 11, 2024Updated last year
- 基于梅尔频谱的信号分类和识别☆23Mar 31, 2023Updated 2 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,424May 21, 2023Updated 2 years ago
- ☆86May 27, 2023Updated 2 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆330Nov 20, 2024Updated last year
- Learning differentiable temporal resolution on time-series data.☆36Nov 12, 2022Updated 3 years ago
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆722Dec 17, 2025Updated 2 months ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆17Nov 19, 2024Updated last year
- Leveraging BERT to Improve Spoken Language Identification☆17Nov 22, 2022Updated 3 years ago
- Official implementation of Hierarchical Spectrogram Transformers (HST)☆19Oct 10, 2022Updated 3 years ago
- ☆21Mar 8, 2020Updated 5 years ago
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,789Dec 8, 2025Updated 2 months ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Jul 13, 2023Updated 2 years ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆1,218Feb 11, 2026Updated 2 weeks ago
- ☆14Jul 11, 2022Updated 3 years ago
- ☆20Apr 27, 2024Updated last year
- Method for Splitting the DeepShip Dataset☆58Nov 21, 2025Updated 3 months ago
- The PyTorch code for "Unraveling Complex Data Diversity in Underwater Acoustic Target Recognition through Convolution-based Mixture of Ex…☆31Mar 5, 2024Updated last year
- ☆12Jun 14, 2022Updated 3 years ago
- Reading list for research topics in Sound AI☆196Aug 8, 2024Updated last year
- The repo provides information about KeSpeech dataset.☆171Oct 13, 2022Updated 3 years ago
- ☆32Updated this week
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别☆1,278Mar 25, 2023Updated 2 years ago
- ☆11Jun 14, 2024Updated last year
- acnn for text-independent speaker recognition