The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
☆596Dec 17, 2025Updated 4 months ago
Alternatives and similar repositories for AudioClassification-Pytorch
Users that are interested in AudioClassification-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法☆102Dec 17, 2025Updated 4 months ago
- A python code based on pytorch applied to AudioClassification☆48Jul 15, 2022Updated 3 years ago
- Python的音频工具☆16Dec 5, 2025Updated 4 months ago
- This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…☆1,269Dec 17, 2025Updated 4 months ago
- 基于Tensorflow实现声音分类,博客地址:☆106May 8, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆1,724Jul 25, 2024Updated last year
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆395Jun 16, 2021Updated 4 years ago
- 城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN☆97Apr 15, 2019Updated 7 years ago
- 基于Pytorch实现的语音情感识别☆269Dec 17, 2025Updated 4 months ago
- ESC-50: Dataset for Environmental Sound Classification☆1,802Mar 20, 2024Updated 2 years ago
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识 别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆312Dec 17, 2025Updated 4 months ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Mar 25, 2021Updated 5 years ago
- ☆88May 27, 2023Updated 2 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆805Apr 11, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,452May 21, 2023Updated 2 years ago
- 基于梅尔频谱的信号分类和识别☆23Mar 31, 2023Updated 3 years ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆489Sep 18, 2025Updated 7 months ago
- Learning differentiable temporal resolution on time-series data.☆37Nov 12, 2022Updated 3 years ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆18Nov 19, 2024Updated last year
- Method for Splitting the DeepShip Dataset☆65Nov 21, 2025Updated 5 months ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆340Nov 20, 2024Updated last year
- Official implementation of Hierarchical Spectrogram Transformers (HST)☆20Oct 10, 2022Updated 3 years ago
- Leveraging BERT to Improve Spoken Language Identification☆17Nov 22, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The PyTorch code for "Unraveling Complex Data Diversity in Underwater Acoustic Target Recognition through Convolution-based Mixture of Ex…☆32Mar 5, 2024Updated 2 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆150Jul 13, 2023Updated 2 years ago
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆33Feb 4, 2024Updated 2 years ago
- Classification of Urban sounds using several classification methods, namely SVM, MLP and CNN using MFCC features.☆13Apr 15, 2020Updated 6 years ago
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆721Dec 17, 2025Updated 4 months ago
- ☆21Mar 8, 2020Updated 6 years ago
- Pytorch port of Google Research's VGGish model used for extracting audio features.☆410Nov 3, 2021Updated 4 years ago
- The official implementation of the paper "A spatio-temporal deep learning approach for underwater acoustic signals classification". In th…☆32Apr 6, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆73Mar 11, 2025Updated last year
- ☆11Jun 14, 2024Updated last year
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,918Dec 8, 2025Updated 4 months ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆1,281Apr 10, 2026Updated 3 weeks ago
- Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别☆1,305Mar 25, 2023Updated 3 years ago
- Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.☆277Feb 8, 2026Updated 2 months ago
- ☆21Apr 27, 2024Updated 2 years ago