sectum1919 / cncvs_data_collectorView external linksLinks
☆27Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for cncvs_data_collector
Users that are interested in cncvs_data_collector are comparing it to the libraries listed below
Sorting:
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Apr 22, 2024Updated last year
- ☆15Oct 10, 2023Updated 2 years ago
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- [BMVC'24] G3FA: Geometry-guided GAN for Face Animation☆20Mar 14, 2025Updated 10 months ago
- livekit agent plugins☆36Updated this week
- ☆15Oct 28, 2019Updated 6 years ago
- ☆20Mar 4, 2024Updated last year
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressions…☆22Dec 12, 2024Updated last year
- ☆21Dec 9, 2023Updated 2 years ago
- Voice conversion with just linear regression.☆33Sep 25, 2025Updated 4 months ago
- Speech-Driven Expression Blendshape Based on Single-Layer Self-attention Network (AIWIN 2022)☆78Oct 21, 2022Updated 3 years ago
- FaceFormer Emo: Speech-Driven 3D Facial Animation with Emotion Embedding☆27Jul 15, 2023Updated 2 years ago
- ☆100Nov 26, 2025Updated 2 months ago
- optimized wav2lip☆18Jan 6, 2024Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆55Dec 20, 2023Updated 2 years ago
- A novel apporach for personalized speech-driven 3D facial animation☆57Apr 26, 2024Updated last year
- SyncNet for Time Synchronization☆30Mar 13, 2023Updated 2 years ago
- ☆29Jun 30, 2025Updated 7 months ago
- ☆24Oct 8, 2021Updated 4 years ago
- ☆28Oct 1, 2023Updated 2 years ago
- Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.☆37Jun 3, 2025Updated 8 months ago
- ☆68Jul 16, 2023Updated 2 years ago
- [NeurIPS 2024] This is the official repo of the paper "Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Li…☆135Feb 9, 2025Updated last year
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆31Sep 23, 2025Updated 4 months ago
- ☆33Nov 28, 2023Updated 2 years ago
- Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 202…☆73Feb 20, 2024Updated last year
- CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?☆130Dec 11, 2024Updated last year
- MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]☆290Jul 7, 2024Updated last year
- 基于MuseTalk的数字人代码。☆35Sep 14, 2024Updated last year
- ☆33Feb 22, 2025Updated 11 months ago
- Official code for ICLR25 "TEASER: Token Enhanced Spatial Modeling for Expressions Reconstruction"☆44Apr 22, 2025Updated 9 months ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆73Apr 7, 2024Updated last year
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Nov 3, 2022Updated 3 years ago
- [3DV'24] GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar☆83May 18, 2024Updated last year
- [INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Datase…☆190Nov 5, 2024Updated last year
- A modified version of vid2vid for Speech2Video, Text2Video Paper☆36Jun 4, 2023Updated 2 years ago
- DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer☆166Mar 31, 2024Updated last year