The project tries to solve a speaker diarization problem using audio features, face recognition and video feature extraction from face image, mouth tracking.
☆15Feb 10, 2019Updated 7 years ago
Alternatives and similar repositories for MultimodalAnalysis_SpeakerDiarization
Users that are interested in MultimodalAnalysis_SpeakerDiarization are comparing it to the libraries listed below
Sorting:
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆33Aug 10, 2020Updated 5 years ago
- ☆11Sep 4, 2023Updated 2 years ago
- 这是一个自动抓取和展示GIS相关学术期刊最新文章的系统。系统会定期从设定的RSS源获取最新文章,并提供中英文双语展示。☆10Jan 14, 2025Updated last year
- Predicting Political Instability and Social Conflicts Using Multimodal Data☆10Jun 6, 2016Updated 9 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- ☆49Nov 24, 2022Updated 3 years ago
- ☆12May 19, 2019Updated 6 years ago
- Summaries of machine learning papers☆12Aug 19, 2022Updated 3 years ago
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago
- ☆15Sep 3, 2025Updated 6 months ago
- Speaker diarization and speech to text☆14Dec 17, 2020Updated 5 years ago
- Testing SSVEP with psychopy☆10Aug 29, 2023Updated 2 years ago
- ☆11Jul 21, 2023Updated 2 years ago
- 🗺️ To be able to discover, request and use aggregate imagery products based on landsat-8/9, Sentinel 2 and other sensors from within QG…☆18Feb 10, 2025Updated last year
- LinuxShell编程笔记☆15Aug 29, 2017Updated 8 years ago
- A Single-Channel Consumer-Grade EEG Device for Brain-Computer Interface: Enhancing Detection of SSVEP and Its Amplitude Modulation (IEEE …☆11Mar 20, 2020Updated 5 years ago
- Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"☆10Dec 19, 2021Updated 4 years ago
- ☆12May 27, 2019Updated 6 years ago
- Perform three types of feature extraction: STFT, MFCC and MelSpectrogram. Apply CNN/VGG with or without RNN architecture. Able to achieve…☆15Jun 28, 2020Updated 5 years ago
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Aug 27, 2023Updated 2 years ago
- This is the code for Coupled-translation Fusion Network.☆11Dec 2, 2021Updated 4 years ago
- ☆22Nov 29, 2024Updated last year
- ☆12Dec 14, 2023Updated 2 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆69Jul 8, 2024Updated last year
- Some Useful Tools Code☆16Feb 3, 2026Updated last month
- ☆17Mar 21, 2024Updated last year
- ☆14Oct 7, 2021Updated 4 years ago
- The updated version of TDAA model.☆14Jul 2, 2020Updated 5 years ago
- Image segmentation is the process of dividing an image into multiple parts. It is typically used to identify objects or other relevant in…☆17Jan 2, 2020Updated 6 years ago
- A Modern Configuration/Registry System designed for deeplearning, with some utils.☆18Dec 23, 2025Updated 2 months ago
- Tune-Mode ConvBN Blocks For Efficient Transfer Learning☆18Aug 1, 2023Updated 2 years ago
- ☆22Oct 10, 2024Updated last year
- SSVEP Brain Computer Interface - Example code for real-time detection of SSVEP using the Canonical Correlation Analysis (CCA) code in rea…☆17Jul 25, 2019Updated 6 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Jan 8, 2021Updated 5 years ago
- TDY-CNN for text-independent speaker verification☆19Nov 7, 2022Updated 3 years ago
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Sep 13, 2022Updated 3 years ago
- A curated list of awesome Voiceprint Recognition papers☆18Jul 9, 2021Updated 4 years ago
- ☆20Oct 23, 2022Updated 3 years ago