clovaai/lookwhostalking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/clovaai/lookwhostalking)

clovaai / lookwhostalking

Look Who’s Talking: Active Speaker Detection in the Wild

☆76

Alternatives and similar repositories for lookwhostalking

Users that are interested in lookwhostalking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tiago-Roxo / WASD
View on GitHub
☆20Updated this week
JaesungHuh / VoxSRC2022
View on GitHub
VoxSRC2022 workshop development kit
☆19Jul 21, 2022Updated 4 years ago
bytecell / slotminer
View on GitHub
Tool for slot extraction from text
☆15Oct 23, 2022Updated 3 years ago
sasv-challenge / SASVC2022_Baseline
View on GitHub
Baseline for the Spoofing-aware Speaker Verification Challenge 2022
☆68May 3, 2022Updated 4 years ago
naver-ai / PfLayer
View on GitHub
Learning Features with Parameter-Free Layers, ICLR 2022
☆84May 3, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
joonson / voxceleb_unsupervised
View on GitHub
Augmentation adversarial training for self-supervised speaker recognition
☆77Aug 15, 2021Updated 4 years ago
EGO4D / audio-visual
View on GitHub
☆69Sep 13, 2022Updated 3 years ago
zaemyung / sentsplit
View on GitHub
A flexible sentence segmentation library using CRF model and regex rules
☆32Apr 16, 2026Updated 3 months ago
joonson / voxsrc_2019
View on GitHub
VoxSRC Challenge
☆31Jun 11, 2019Updated 7 years ago
naver-ai / hype
View on GitHub
[ECCV 2024] Official PyTorch implementation of "HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts"
☆20Nov 22, 2024Updated last year
JaesungHuh / VoxMovies
View on GitHub
Evaluation script for VoxMovies dataset in PyTorch
☆23Jan 12, 2024Updated 2 years ago
naver-ai / augsub
View on GitHub
[CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"
☆46Mar 25, 2025Updated last year
passing2961 / PersonaChatGen
View on GitHub
🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"
☆13Mar 26, 2024Updated 2 years ago
tuanchien / asd
View on GitHub
Active Speaker Detection
☆19Jun 19, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
joonson / voxconverse
View on GitHub
Spot the conversation: speaker diarisation in the wild
☆171Jul 26, 2022Updated 4 years ago
a-nagrani / VoxSRC2020
View on GitHub
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020
☆43Jul 17, 2020Updated 6 years ago
Jiang-Yidi / TS-TalkNet
View on GitHub
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
☆61May 29, 2023Updated 3 years ago
zaemyung / streamlit-tutorial
View on GitHub
A simple tutorial script on Streamlit using the Iris Dataset
☆13Sep 13, 2023Updated 2 years ago
passing2961 / EmoNSMC
View on GitHub
Korean large emotion labeled dataset (EmoNSMC)
☆14Mar 5, 2020Updated 6 years ago
okankop / ASDNet
View on GitHub
Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset
☆73Jan 18, 2022Updated 4 years ago
passing2961 / Stark
View on GitHub
Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…
☆19Dec 27, 2024Updated last year
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 3 months ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
deepaudio / deepaudio-speaker
View on GitHub
neural network based speaker embedder
☆24Jan 7, 2023Updated 3 years ago
naver-ai / imagenet-annotation-tool
View on GitHub
☆17Jul 24, 2023Updated 3 years ago
naver-ai / seit
View on GitHub
[ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT
☆56Aug 12, 2024Updated last year
naver-ai / tobo
View on GitHub
[NeurIPS 2025] Official PyTorch implementation of "Token Bottleneck: One Token to Remember Dynamics"
☆32Feb 2, 2026Updated 5 months ago
Jungjee / RawNet
View on GitHub
Official repository for RawNet, RawNet2, and RawNet3
☆407Mar 21, 2024Updated 2 years ago
Sindhu-Hegde / multivsr
View on GitHub
Official code for the paper "Scaling Multilingual Visual Speech Recognition"
☆20Aug 15, 2025Updated 11 months ago
plnguyen2908 / UniTalk-ASD-code
View on GitHub
[Interspeech 2026] Revisiting Active Speaker Detection: An In-the-Wild Benchmark for Generalization and Robustness
☆22Jun 25, 2026Updated last month
JaesungHuh / av-diarization
View on GitHub
Audio-visual diarization pipeline used for creating VoxConverse dataset
☆22Jun 6, 2025Updated last year
facebookresearch / MMCSG
View on GitHub
This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …
☆41Mar 13, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
joonson / syncnet_python
View on GitHub
Out of time: automated lip sync in the wild
☆895Apr 17, 2026Updated 3 months ago
fuankarion / active-speakers-context
View on GitHub
Code for the Active Speakers in Context Paper (CVPR2020)
☆58May 19, 2021Updated 5 years ago
zaocan666 / DyViSE
View on GitHub
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
☆12Jul 5, 2022Updated 4 years ago
BUTSpeechFIT / mt-asr-data-prep
View on GitHub
☆25Feb 26, 2026Updated 5 months ago
naver-ai / negmerge
View on GitHub
[ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"
☆16Nov 25, 2025Updated 8 months ago
naver-ai / calm
View on GitHub
☆90Jun 8, 2022Updated 4 years ago
walkoncross / voxceleb2-download-zyf
View on GitHub
Tools for downloading VoxCeleb2 dataset
☆35Mar 16, 2024Updated 2 years ago