KingH12138 / Pytorch-AudioClassification-masterLinks
A python code based on pytorch applied to AudioClassification
☆48Updated 3 years ago
Alternatives and similar repositories for Pytorch-AudioClassification-master
Users that are interested in Pytorch-AudioClassification-master are comparing it to the libraries listed below
Sorting:
- The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a vari…☆562Updated 7 months ago
- 基于Pytorch实现的语音情感识别☆247Updated 7 months ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆160Updated 2 years ago
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆213Updated 5 years ago
- alaaNfissi / SigWavNet-Learning-Multiresolution-Signal-Wavelet-Network-for-Speech-Emotion-RecognitionThis paper has been accepted for publication in IEEE Transactions on Affective Computing.☆18Updated 9 months ago
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆103Updated 2 years ago
- Deformable Speech Transformer (DST)☆34Updated last year
- 基于梅尔频谱的信号分类和识别☆23Updated 2 years ago
- 用CASIA database数据集做的,做的语音情感识别和语音识人的练习☆72Updated 2 years ago
- Speech Emotion Recognition☆28Updated 5 years ago
- 这个项目将 RAVDESS 数据集切割成 1s 短语音,利用 openSMILE+CNN 进行训练,目标是将短语音分类到四种情感中,分别是:开心(happy)、悲伤(sad)、生气(angry)和中性(neutral)。最后准确率达到 76% 左右。☆61Updated 4 years ago
- ☆16Updated 6 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆184Updated last year
- The PyTorch code for "Unraveling Complex Data Diversity in Underwater Acoustic Target Recognition through Convolution-based Mixture of Ex…☆29Updated last year
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆77Updated last year
- 多模态,语音和文本结合的情感识别,大模型finetune☆23Updated 2 years ago
- ☆25Updated last year
- This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…☆1,185Updated 5 months ago
- Paper List☆18Updated 5 months ago
- A spectro-temporal fusion feature, STgram, with MobileFaceNet For more stable Anomalous Sound Detection☆96Updated 2 years ago
- Automatic Depression Detection: a GRU/ BiLSTM-based Model and An Emotional Audio-Textual Corpus☆194Updated 2 years ago
- Speech Emotion Classification with novel Parallel CNN-Transformer model built with PyTorch, plus thorough explanations of CNNs, Transform…☆259Updated 5 years ago
- Official implementation of the paper "An Investigation of Preprocessing Filters and Deep Learning Methods for Vessel Type Classification …☆28Updated last year
- ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore t…☆514Updated 7 months ago
- 语音信号处理试验教程,Python代码☆341Updated 3 years ago
- ☆21Updated 4 months ago
- ☆64Updated 5 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆760Updated last year
- Method for Splitting the DeepShip Dataset☆50Updated 2 weeks ago
- An unofficial train-test split for ShipsEar: An underwater vessel noise database☆22Updated last year