valiakon/MultimodalAnalysis_SpeakerDiarization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/valiakon/MultimodalAnalysis_SpeakerDiarization)

valiakon / MultimodalAnalysis_SpeakerDiarization

The project tries to solve a speaker diarization problem using audio features, face recognition and video feature extraction from face image, mouth tracking.

☆16

Alternatives and similar repositories for MultimodalAnalysis_SpeakerDiarization

Users that are interested in MultimodalAnalysis_SpeakerDiarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jagabandhumishra / W2V-E2E-Language-Diarization
View on GitHub
☆11Sep 4, 2023Updated 2 years ago
konstantinklemmer / sxl
View on GitHub
SXL: Spatially explicit learning of geographic processes with auxiliary tasks
☆15Nov 26, 2021Updated 4 years ago
kristinagligoric / confidence-driven-inference
View on GitHub
☆17Jul 23, 2025Updated last year
WangYihang / LinuxShellScript
View on GitHub
LinuxShell编程笔记
☆15Aug 29, 2017Updated 8 years ago
nishithbsk / ConflictPrediction
View on GitHub
Predicting Political Instability and Social Conflicts Using Multimodal Data
☆10Jun 6, 2016Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NoobZ2 / Annotation
View on GitHub
电子病历标注工具DEMO
☆13Jun 21, 2019Updated 7 years ago
vita-epfl / rock-pytorch
View on GitHub
A PyTorch implementation of "Revisiting Multi-Task Learning with ROCK: a Deep Residual Auxiliary Block for Visual Detection"
☆14Jun 29, 2020Updated 6 years ago
shincling / TDAAv2
View on GitHub
The updated version of TDAA model.
☆14Jul 2, 2020Updated 6 years ago
cmFighting / mnist_demo_torch1.6
View on GitHub
Mnist数据集demo，基于torch1.6开发
☆12Aug 30, 2020Updated 5 years ago
terry-yip / speech-to-text
View on GitHub
Speaker diarization and speech to text
☆14Dec 17, 2020Updated 5 years ago
tango4j / Python-Speaker-Diarization
View on GitHub
Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"
☆11Apr 6, 2020Updated 6 years ago
shkim816 / acnn_speaker_recog
View on GitHub
acnn for text-independent speaker recognition
☆10Feb 8, 2022Updated 4 years ago
HaoFengyuan / EEND-IAAE
View on GitHub
The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…
☆11Aug 27, 2023Updated 2 years ago
vishalshar / SpeakerDiarization_RNN_CNN_LSTM
View on GitHub
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…
☆64Jan 8, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
GraphPKU / Open-World-KG
View on GitHub
The official codes of Rethinking Knowledge Graph Evaluation Under the Open-World Assumption (NeurIPS 2022)
☆24Sep 20, 2022Updated 3 years ago
JiJiJiang / ASV-Anti-Spoofing-DADA
View on GitHub
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
☆19Jul 17, 2026Updated last week
AnkushMalaker / pretrained-dcnn-attention-ser
View on GitHub
Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"
☆10Dec 19, 2021Updated 4 years ago
david-yoon / attentive-modality-hopping-for-SER
View on GitHub
TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20
☆34Aug 10, 2020Updated 5 years ago
yakovmon / Real-Time-Audio-Visual-Speech-Enhancement
View on GitHub
☆13May 27, 2019Updated 7 years ago
khtee / text-classification-pytorch
View on GitHub
Pytorch implementation of RNN, CNN, BiGRU and LSTM for text classifcation
☆10Apr 30, 2021Updated 5 years ago
yas-sim / openvino-real-time-noise-suppression-demo
View on GitHub
Modified version of OpenVINO noise_suppression_demo. This version can handle real-time audio stream from microphone and output to headpho…
☆16Aug 5, 2021Updated 4 years ago
nexuslrf / Accel-Video-Pipe
View on GitHub
AVPipe :-)
☆12Jul 16, 2021Updated 5 years ago
YingkunZhou / EdgeTransformerBench
View on GitHub
edge/mobile transformer based Vision DNN inference benchmark
☆16Aug 29, 2025Updated 10 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
PHJhjpeng1992 / awesome-asv-antispoofing
View on GitHub
This is a curated list of awesome ASV(Automatic Speaker Verification) Anti-Spoofing papers, libraries, datasets, and other resources.
☆22May 21, 2021Updated 5 years ago
WiraDKP / pytorch_speaker_embedding_for_diarization
View on GitHub
Using speaker embedding for diarization in PyTorch
☆17Aug 29, 2020Updated 5 years ago
zcxu-eric / AVA-AVD
View on GitHub
☆51Nov 24, 2022Updated 3 years ago
solmp / VideoMatting
View on GitHub
Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python
☆12Mar 10, 2022Updated 4 years ago
guozixunnicolas / DENT_DDSP
View on GitHub
☆24Jun 30, 2023Updated 3 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
WiraDKP / pytorch_gru_speaker_diarization
View on GitHub
Speaker Diarization using GRU in PyTorch
☆11Aug 29, 2020Updated 5 years ago
XiaoyuXU1 / Representational_Analysis_Tools
View on GitHub
☆15May 23, 2025Updated last year
PreckLi / MIP-Editor
View on GitHub
Official implementation of Cross-Modal Unlearning via Influential Neuron Path Editing in Multimodal Large Language Models
☆16Mar 21, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HackerMi / miracast
View on GitHub
☆12Nov 6, 2015Updated 10 years ago
SpeechClub / CDER_Metric
View on GitHub
CDER (Conversational Diarization Error Rate) Scoring Tool
☆22Sep 13, 2022Updated 3 years ago
mechanicalsea / sugar
View on GitHub
Efficient Speech Processing Tookit for Automatic Speaker Recognition
☆17Feb 8, 2023Updated 3 years ago
circle-hit / MuCDN
View on GitHub
Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…
☆10Jul 21, 2023Updated 3 years ago
EdVince / ncnn-tnn-mnn-android-demo
View on GitHub
ncnn & tnn & mnn 三合一的安卓 Camera & Gallery 工程
☆14Jul 22, 2022Updated 4 years ago
DeclanHoare / matterbabble
View on GitHub
Connect Discourse threads to Matterbridge
☆17Feb 21, 2019Updated 7 years ago
trinhtuanvubk / KWS-BCResnet
View on GitHub
Keyword Spotting using BCResNet and Arcface Loss
☆13Jan 28, 2022Updated 4 years ago