Disentangled Speech Embeddings using Cross-Modal Self-Supervision
☆166Apr 12, 2020Updated 5 years ago
Alternatives and similar repositories for syncnet_trainer
Users that are interested in syncnet_trainer are comparing it to the libraries listed below
Sorting:
- Out of time: automated lip sync in the wild☆873Jan 23, 2024Updated 2 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆78Aug 15, 2021Updated 4 years ago
- In defence of metric learning for speaker recognition☆1,165Mar 26, 2024Updated last year
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"☆423May 12, 2024Updated last year
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆115Nov 16, 2020Updated 5 years ago
- Utterance-level Aggregation For Speaker Recognition In The Wild☆372Mar 24, 2023Updated 2 years ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆74Apr 7, 2024Updated last year
- Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)☆30Feb 28, 2025Updated last year
- A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.☆243Feb 15, 2024Updated 2 years ago
- ☆42Nov 22, 2024Updated last year
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆209Dec 8, 2022Updated 3 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 9 months ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- PyTorch implementation of RPNSD☆60Jun 17, 2024Updated last year
- ☆21Apr 6, 2021Updated 4 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Apr 29, 2020Updated 5 years ago
- ☆104Jul 5, 2023Updated 2 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32May 16, 2019Updated 6 years ago
- SyncNet for Time Synchronization☆30Mar 13, 2023Updated 3 years ago
- Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose" (Arxiv 2020) and "Predicting Personalize…☆776Dec 15, 2023Updated 2 years ago
- Demo for 2022 Interspeech☆29Jun 14, 2022Updated 3 years ago
- ☆18Nov 22, 2024Updated last year
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆32Jul 14, 2019Updated 6 years ago
- ☆843Nov 19, 2025Updated 4 months ago
- A self-supervised learning framework for audio-visual speech☆976Dec 7, 2023Updated 2 years ago
- Tensorflow implementation of x-vector topology on top of Kaldi recipe☆119Nov 5, 2019Updated 6 years ago
- Audio-Visual Speech Separation with Cross-Modal Consistency☆247Jul 25, 2023Updated 2 years ago
- ☆64Jun 28, 2023Updated 2 years ago
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆16Mar 17, 2025Updated last year
- PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆215Aug 8, 2023Updated 2 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- Official github repo for paper "What comprises a good talking-head video generation?: A Survey and Benchmark"☆91Dec 8, 2022Updated 3 years ago
- MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]☆296Jul 7, 2024Updated last year
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆88Apr 23, 2019Updated 6 years ago
- ☆17Aug 27, 2025Updated 6 months ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- ☆429Nov 1, 2023Updated 2 years ago