valiakon / MultimodalAnalysis_SpeakerDiarizationLinks
The project tries to solve a speaker diarization problem using audio features, face recognition and video feature extraction from face image, mouth tracking.
☆15Updated 7 years ago
Alternatives and similar repositories for MultimodalAnalysis_SpeakerDiarization
Users that are interested in MultimodalAnalysis_SpeakerDiarization are comparing it to the libraries listed below
Sorting:
- ☆42Updated 5 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆25Updated 4 years ago
- ☆112Updated 3 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Updated 4 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆95Updated 2 years ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆40Updated 2 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Updated last year
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆187Updated last year
- ☆17Updated 4 years ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆53Updated last year
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Updated 4 years ago
- Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取☆216Updated 5 years ago
- Multilingual datasets with raw audio for speech emotion recognition☆30Updated 4 years ago
- Repository for code and paper submitted for APSIPA 2019, Lanzhou, China☆21Updated last year
- ☆157Updated 3 years ago
- ☆103Updated 4 years ago
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆27Updated 3 years ago
- Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition☆83Updated 3 years ago
- ☆49Updated 2 years ago
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Updated 2 years ago
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch☆42Updated last year
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Updated last year
- Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.150…☆83Updated 4 years ago
- Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch☆106Updated 5 years ago
- Time-domain synthetic speech detection net (TSSDNet), having the classic ResNet and Inception Net style structures (Res-TSSDNet and Inc-T…☆70Updated 4 years ago
- Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"☆56Updated 3 years ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆164Updated 2 years ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆50Updated 3 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆133Updated 3 years ago