zaocan666/DyViSE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zaocan666/DyViSE)

zaocan666 / DyViSE

Dynamic vision-guided speaker embedding for audio-visual speaker diarization

☆12

Alternatives and similar repositories for DyViSE

Users that are interested in DyViSE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yucongzh / online_speaker_diarization
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
mispchallenge / misp2022_baseline
View on GitHub
☆33Jun 26, 2023Updated 3 years ago
JaesungHuh / ca-subtitle
View on GitHub
Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"
☆21Nov 3, 2025Updated 8 months ago
zcxu-eric / AVA-AVD
View on GitHub
☆51Nov 24, 2022Updated 3 years ago
X-LANCE / MSDWILD
View on GitHub
[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
☆66Jan 24, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
qinxiaoyi / Simple-Attention-Module-based-Speaker-Verification-with-Iterative-Noisy-Label-Detection
View on GitHub
☆12Jun 14, 2022Updated 4 years ago
Miffyli / asv-cm-reinforce
View on GitHub
Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE
☆13Mar 31, 2021Updated 5 years ago
Wangtk311 / SafeEar-Inference-Test-Script
View on GitHub
SafeEar是由浙大和清华共同开发的一种深度伪声探测模型。这是我撰写的模型推理脚本。我不确定它是否正确，目前我还是初学者，如有问题请原谅我并指出，谢谢！
☆16May 16, 2025Updated last year
Tiago-Roxo / WASD
View on GitHub
☆20Updated this week
xiaoxiaomiao323 / MSA
View on GitHub
☆16Feb 19, 2026Updated 5 months ago
haoheliu / DCASE_2022_Task_5
View on GitHub
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆28Jul 6, 2022Updated 4 years ago
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
yzyouzhang / SASV_PR
View on GitHub
Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"
☆18Jun 24, 2022Updated 4 years ago
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Junhua-Liao / Light-ASD
View on GitHub
The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)
☆181Mar 23, 2025Updated last year
JaesungHuh / av-diarization
View on GitHub
Audio-visual diarization pipeline used for creating VoxConverse dataset
☆22Jun 6, 2025Updated last year
showlab / AVA-AVD
View on GitHub
☆22Nov 24, 2022Updated 3 years ago
DanielMengLiu / AudioVisualLip
View on GitHub
☆25Feb 20, 2024Updated 2 years ago
yzyouzhang / Empirical-Channel-CM
View on GitHub
Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure …
☆19Feb 15, 2022Updated 4 years ago
lstappen / MuSe2020
View on GitHub
Accompany code to reproduce the baselines of the International Multimodal Sentiment Analysis Challenge (MuSe 2020).
☆16Dec 8, 2022Updated 3 years ago
aleXiehta / Causal-SE
View on GitHub
Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"
☆28Feb 26, 2023Updated 3 years ago
lstappen / MuSe-Toolbox
View on GitHub
A Phyton toolbox to fuse multiple continuous emotion annotations from several raters and diarization them to classes!
☆14Oct 24, 2021Updated 4 years ago
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
SheldonTsui / PseudoBinaural_CVPR2021
View on GitHub
Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)
☆72Jul 8, 2021Updated 5 years ago
GeWanying / shap-anti-spoofing
View on GitHub
This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…
☆12Jan 24, 2024Updated 2 years ago
zhang-wy15 / Attack_practical_asv
View on GitHub
ICASSP 2021 accepted paper
☆20May 20, 2021Updated 5 years ago
wenet-e2e / wesignal
View on GitHub
Production first, nn-based on-device signal processing toolkit.
☆63May 30, 2023Updated 3 years ago
NARUTO-2024 / WavBench
View on GitHub
WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models
☆38Feb 13, 2026Updated 5 months ago
zhermin / LeetNode
View on GitHub
An Adaptive Learning Software for Professors and Students (NUS-Exclusive Presently)
☆18Apr 1, 2024Updated 2 years ago
dihardchallenge / dihard3_baseline
View on GitHub
☆30Jul 21, 2022Updated 4 years ago
ju-kl / severe-weather-prediction
View on GitHub
Prediction of severe weather events and their damage in the US using machine learning
☆10Aug 2, 2020Updated 5 years ago
gi0baro / poetry-bin
View on GitHub
Poetry binary builds
☆21May 27, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
york135 / CTC_CE_for_AST
View on GitHub
The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…
☆12Mar 25, 2025Updated last year
wngh1187 / ExU-Net
View on GitHub
Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments
☆28Jul 24, 2023Updated 3 years ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
fzi-forschungszentrum-informatik / NNAD
View on GitHub
Neural Networks for Automated Driving
☆14Mar 30, 2021Updated 5 years ago
jksingh07 / Garbage-Detection
View on GitHub
This project was built during the competition of Smart India Hackathon 2020. In This I am using a Android device's Camera to detect Garba…
☆12Apr 5, 2023Updated 3 years ago
mlpc-ucsd / ConstellationNet
View on GitHub
(ICLR 2021) ConstellationNet: Attentional Constellation Nets for Few-Shot Learning
☆14Apr 4, 2022Updated 4 years ago
SheldonTsui / SepStereo_ECCV2020
View on GitHub
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
☆72Oct 20, 2020Updated 5 years ago