☆21Feb 15, 2022Updated 4 years ago
Alternatives and similar repositories for Ego4d_TalkNet_ASD
Users that are interested in Ego4d_TalkNet_ASD are comparing it to the libraries listed below
Sorting:
- ☆67Sep 13, 2022Updated 3 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- Code for the Active Speakers in Context Paper (CVPR2020)☆56May 19, 2021Updated 4 years ago
- ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'☆450Oct 23, 2023Updated 2 years ago
- Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset☆72Jan 18, 2022Updated 4 years ago
- ☆13May 9, 2022Updated 3 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆59Nov 23, 2020Updated 5 years ago
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated 11 months ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆45May 25, 2023Updated 2 years ago
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆20Jan 12, 2023Updated 3 years ago
- ☆20Dec 29, 2024Updated last year
- ☆49Nov 24, 2022Updated 3 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆54Jan 29, 2024Updated 2 years ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Sep 19, 2025Updated 5 months ago
- This repository presents FSD dataset for song deepfake detection.☆25Aug 18, 2025Updated 6 months ago
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆68Oct 29, 2023Updated 2 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’☆21Dec 19, 2025Updated 2 months ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆49May 1, 2023Updated 2 years ago
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆167Mar 23, 2025Updated 11 months ago
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆16Jun 12, 2025Updated 8 months ago
- Code for the paper PermuteFormer☆41Oct 10, 2021Updated 4 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- ☆48Jun 26, 2025Updated 8 months ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- ☆13Oct 25, 2024Updated last year
- ☆11Nov 7, 2024Updated last year
- Generate visuals based on music rhythm using reaction-diffusion.☆10Nov 15, 2024Updated last year
- AI Deinterlacing functions for Vapoursynth☆17Nov 4, 2025Updated 4 months ago
- 为visinger SVS系统写的展示系统~本质仍然是个音乐播放器☆11Apr 18, 2023Updated 2 years ago
- ☆11Jan 12, 2023Updated 3 years ago
- This is our Final Year Project titled " Implementation of seam carving for image retargeting using CUDA enabled GPU"☆11Nov 16, 2024Updated last year