The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
☆590Dec 17, 2025Updated 3 months ago
Alternatives and similar repositories for AudioClassification-Pytorch
Users that are interested in AudioClassification-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法☆102Dec 17, 2025Updated 3 months ago
- A python code based on pytorch applied to AudioClassification☆48Jul 15, 2022Updated 3 years ago
- Python的音频工具☆16Dec 5, 2025Updated 4 months ago
- This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…☆1,257Dec 17, 2025Updated 3 months ago
- 基于Tensorflow实现声音分类,博客地址:☆107May 8, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆1,702Jul 25, 2024Updated last year
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆393Jun 16, 2021Updated 4 years ago
- 城市声音分类 Urban Sound Classification with TensorFlow Keras - MLP, RNN, CNN☆97Apr 15, 2019Updated 6 years ago
- 基于Pytorch实现的语音情感识别☆266Dec 17, 2025Updated 3 months ago
- ESC-50: Dataset for Environmental Sound Classification☆1,787Mar 20, 2024Updated 2 years ago
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆308Dec 17, 2025Updated 3 months ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Mar 25, 2021Updated 5 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆800Apr 11, 2024Updated 2 years ago
- ☆88May 27, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,447May 21, 2023Updated 2 years ago
- 基于梅尔频谱的信号分类和识别☆23Mar 31, 2023Updated 3 years ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆481Sep 18, 2025Updated 6 months ago
- Learning differentiable temporal resolution on time-series data.☆37Nov 12, 2022Updated 3 years ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆17Nov 19, 2024Updated last year
- Method for Splitting the DeepShip Dataset☆62Nov 21, 2025Updated 4 months ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆338Nov 20, 2024Updated last year
- Official implementation of Hierarchical Spectrogram Transformers (HST)☆20Oct 10, 2022Updated 3 years ago
- Leveraging BERT to Improve Spoken Language Identification☆17Nov 22, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The PyTorch code for "Unraveling Complex Data Diversity in Underwater Acoustic Target Recognition through Convolution-based Mixture of Ex…☆31Mar 5, 2024Updated 2 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆150Jul 13, 2023Updated 2 years ago
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆32Feb 4, 2024Updated 2 years ago
- Classification of Urban sounds using several classification methods, namely SVM, MLP and CNN using MFCC features.☆13Apr 15, 2020Updated 5 years ago
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆723Dec 17, 2025Updated 3 months ago
- ☆21Mar 8, 2020Updated 6 years ago
- Pytorch port of Google Research's VGGish model used for extracting audio features.☆410Nov 3, 2021Updated 4 years ago
- The official implementation of the paper "A spatio-temporal deep learning approach for underwater acoustic signals classification". In th…☆31Apr 6, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆73Mar 11, 2025Updated last year
- ☆11Jun 14, 2024Updated last year
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,872Dec 8, 2025Updated 4 months ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆1,257Mar 31, 2026Updated last week
- Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别☆1,296Mar 25, 2023Updated 3 years ago
- Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.☆276Feb 8, 2026Updated 2 months ago
- ☆21Apr 27, 2024Updated last year