tuanchien/asd

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tuanchien/asd)

tuanchien / asd

Active Speaker Detection

☆19

Alternatives and similar repositories for asd

Users that are interested in asd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

okankop / ASDNet
View on GitHub
Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset
☆73Jan 18, 2022Updated 4 years ago
zcxu-eric / Ego4d_TalkNet_ASD
View on GitHub
☆21Feb 15, 2022Updated 4 years ago
EGO4D / audio-visual
View on GitHub
☆69Sep 13, 2022Updated 3 years ago
afourast / avobjects
View on GitHub
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
☆114Nov 16, 2020Updated 5 years ago
zcxu-eric / AVA-AVD
View on GitHub
☆51Nov 24, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dragnet-org / dragnet_data
View on GitHub
code and data used to build a training dataset for dragnet models
☆10Nov 29, 2020Updated 5 years ago
clovaai / lookwhostalking
View on GitHub
Look Who’s Talking: Active Speaker Detection in the Wild
☆76Aug 24, 2023Updated 2 years ago
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
chanzuckerberg / software-mention-extraction
View on GitHub
Software mention extraction and linking from scientific articles
☆14Sep 2, 2022Updated 3 years ago
MGH-LMIC / graynet_keras
View on GitHub
Pretrained parameters for CT deep learning models.
☆13Sep 24, 2019Updated 6 years ago
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
walkoncross / voxceleb2-download-zyf
View on GitHub
Tools for downloading VoxCeleb2 dataset
☆35Mar 16, 2024Updated 2 years ago
FloretCat / CMRAN
View on GitHub
Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization， ACM MM 2020
☆33Nov 6, 2020Updated 5 years ago
WangHelin1997 / SpecAugment-plus
View on GitHub
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆34Jun 25, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
varanauskas / nexered
View on GitHub
Support Next.js redirects in Cloudflare Pages
☆18Nov 27, 2022Updated 3 years ago
dkuhman / biomech_gait_analysis
View on GitHub
This repository hosts scripts related to human biomechanical gait analysis
☆13May 16, 2020Updated 6 years ago
diku-dk / RenalVesselSeg
View on GitHub
☆10Jun 2, 2023Updated 3 years ago
xiaoxiaomiao323 / MSA
View on GitHub
☆16Feb 19, 2026Updated 5 months ago
TaoRuijie / TalkNet-ASD
View on GitHub
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
☆489Oct 23, 2023Updated 2 years ago
mavceleb / mavceleb_baseline
View on GitHub
☆11Nov 5, 2025Updated 8 months ago
afourast / deep_lip_reading
View on GitHub
Code and models for evaluating a state-of-the-art lip reading network
☆196Mar 24, 2023Updated 3 years ago
cuis15 / learning-to-collaborate
View on GitHub
☆11May 27, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Overcautious / ADENet
View on GitHub
Accepted by TMM 2022
☆19Aug 18, 2022Updated 3 years ago
zaocan666 / CollageNet
View on GitHub
code and demo of the ISMIR 2021 paper CollageNet
☆12Jul 12, 2021Updated 5 years ago
basavaraj-hampiholi / Age-Estimation--DEX-in-Pytorch
View on GitHub
Age Estimation: Implementation of DEX paper in Pytorch
☆10Jan 17, 2020Updated 6 years ago
luan78zaoha / kaldi-timit-sre-ivector
View on GitHub
Develop speaker recognition model based on i-vector using TIMIT database
☆16Jul 4, 2019Updated 7 years ago
hellbell / KeyPatchGan
View on GitHub
[ECCV 2018] Unsupervised Holistic Image Generation from Key Local Patches
☆12Jul 29, 2019Updated 6 years ago
TaoRuijie / AVCleanse
View on GitHub
ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆44Oct 31, 2022Updated 3 years ago
shincling / discreteSeparation
View on GitHub
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Oct 25, 2021Updated 4 years ago
YihengZhang-CV / MCL-Motion-Focused-Contrastive-Learning
View on GitHub
☆15Jan 11, 2022Updated 4 years ago
Queequeg92 / DualPathNet
View on GitHub
Dual Path Networks on cifar-10 and fashion-mnist datasets
☆18Aug 31, 2017Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
RetroCirce / Auto-mask-Music-Generative-Model-via-EC2-VAE-Disentanglement
View on GitHub
Implementing EC2-VAE to the conditional generative model to generate music with controlling rhythm patterns
☆14Aug 13, 2020Updated 5 years ago
Jiang-Yidi / FlatTrajectoryDistillation_FTD
View on GitHub
The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)
☆18Mar 21, 2023Updated 3 years ago
DhilipSanjay / Human-Biomechanic-Analysis
View on GitHub
Deep Learning Models for the Early Detection of Parkinson’s Disease using the motor-based symptoms.
☆17Feb 19, 2022Updated 4 years ago
danmic / av-se
View on GitHub
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
☆222Apr 16, 2023Updated 3 years ago
ArmanAfrasiyabi / associative-alignment-fs
View on GitHub
Code for "Associative alignment for few-shot image classification"- ECCV'2020.
☆20Nov 23, 2020Updated 5 years ago
SRA2 / SPELL
View on GitHub
Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)
☆67Oct 29, 2023Updated 2 years ago
l3das / L3DAS21
View on GitHub
☆37Jun 22, 2022Updated 4 years ago