KingH12138 / Pytorch-AudioClassification-master
A python code based on pytorch applied to AudioClassification
☆39Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Pytorch-AudioClassification-master
- The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a vari…☆419Updated last week
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆129Updated 11 months ago
- Deformable Speech Transformer (DST)☆27Updated 3 months ago
- 基于Pytorch实现的语音情感识别☆134Updated 2 months ago
- 说话人识别(声纹识别)算法的Python实现。包括GMM(已完成)、GMM-UBM、ivector、基于深度学习的声纹识别(self-attention已完成)。☆77Updated last year
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆182Updated 4 years ago
- Speech Emotion Recognition☆26Updated 4 years ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆66Updated 8 months ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆161Updated 6 months ago
- Repository for code and paper submitted for APSIPA 2019, Lanzhou, China☆22Updated 3 months ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆144Updated 3 years ago
- Audio Split 基于双门限法的语音端点检测及语音分割☆127Updated 4 years ago
- 这个项目将 RAVDESS 数据集切割成 1s 短语音,利用 openSMILE+CNN 进行训练,目标是将短语音分类到四种情感中,分别是:开心(happy)、悲伤(sad)、生气(angry)和中性(neutral)。最后准确率达到 76% 左右。☆52Updated 3 years ago
- The PyTorch code for "Unraveling Complex Data Diversity in Underwater Acoustic Target Recognition through Convolution-based Mixture of Ex…☆13Updated 8 months ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆610Updated 7 months ago
- SpeechFormer++ in PyTorch☆42Updated last year
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆35Updated 11 months ago
- Paper List☆18Updated last month
- A spectro-temporal fusion feature, STgram, with MobileFaceNet For more stable Anomalous Sound Detection☆73Updated last year
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆51Updated 4 months ago
- 基于Tensorflow实现声音分类,博客地址:☆95Updated 4 years ago
- Bachelor Thesis - Deep Learning-based Multi-modal Depression Estimation☆55Updated last year
- 多模态,语音和文本结合的情感识别,大模型finetune☆13Updated last year
- Histogram Layer Time Delay Neural Networks For Passive Sonar Classification☆13Updated 7 months ago
- Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module☆58Updated 2 years ago
- ☆16Updated 2 weeks ago
- ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore t…☆402Updated this week
- ☆20Updated 3 weeks ago
- [EMNLP2023] Conversation Understanding using Relational Temporal Graph Neural Networks with Auxiliary Cross-Modality Interaction☆46Updated 4 months ago
- 语音增强☆15Updated 3 years ago