☆27Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for cncvs_data_collector
Users that are interested in cncvs_data_collector are comparing it to the libraries listed below
Sorting:
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Apr 22, 2024Updated last year
- ☆15Oct 10, 2023Updated 2 years ago
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- [BMVC'24] G3FA: Geometry-guided GAN for Face Animation☆20Mar 14, 2025Updated 11 months ago
- ☆15Oct 28, 2019Updated 6 years ago
- ☆21Mar 4, 2024Updated 2 years ago
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressions…☆22Dec 12, 2024Updated last year
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Apr 27, 2024Updated last year
- livekit agent plugins☆38Feb 19, 2026Updated 2 weeks ago
- ☆21Dec 9, 2023Updated 2 years ago
- Voice conversion with just linear regression.☆35Sep 25, 2025Updated 5 months ago
- Speech-Driven Expression Blendshape Based on Single-Layer Self-attention Network (AIWIN 2022)☆78Oct 21, 2022Updated 3 years ago
- FaceFormer Emo: Speech-Driven 3D Facial Animation with Emotion Embedding☆27Jul 15, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- optimized wav2lip☆18Jan 6, 2024Updated 2 years ago
- [IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast☆21Oct 25, 2023Updated 2 years ago
- ☆102Nov 26, 2025Updated 3 months ago
- A novel apporach for personalized speech-driven 3D facial animation☆59Apr 26, 2024Updated last year
- ☆55Dec 20, 2023Updated 2 years ago
- SyncNet for Time Synchronization☆30Mar 13, 2023Updated 2 years ago
- ☆24Oct 8, 2021Updated 4 years ago
- ☆28Oct 1, 2023Updated 2 years ago
- ☆30Jun 30, 2025Updated 8 months ago
- Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.☆37Jun 3, 2025Updated 9 months ago
- ☆68Jul 16, 2023Updated 2 years ago
- [NeurIPS 2024] This is the official repo of the paper "Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Li…☆134Feb 9, 2025Updated last year
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆31Sep 23, 2025Updated 5 months ago
- ☆33Nov 28, 2023Updated 2 years ago
- Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 202…☆74Feb 20, 2024Updated 2 years ago
- CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?☆131Dec 11, 2024Updated last year
- MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]☆292Jul 7, 2024Updated last year
- 基于MuseTalk的数字人代码。☆35Sep 14, 2024Updated last year
- ☆33Feb 22, 2025Updated last year
- Official code for ICLR25 "TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction"☆46Apr 22, 2025Updated 10 months ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆73Apr 7, 2024Updated last year
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- [3DV'24] GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar☆83May 18, 2024Updated last year
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Nov 3, 2022Updated 3 years ago
- [INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Datase…☆191Nov 5, 2024Updated last year