☆27Jun 27, 2023Updated 2 years ago
Alternatives and similar repositories for cncvs_data_collector
Users that are interested in cncvs_data_collector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Apr 22, 2024Updated last year
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Apr 27, 2024Updated last year
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language (AAAI 2025)☆23Mar 17, 2025Updated last year
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- livekit agent plugins☆40Feb 19, 2026Updated last month
- FaceFormer Emo: Speech-Driven 3D Facial Animation with Emotion Embedding☆27Jul 15, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Visual Speech Recongnition☆20Dec 24, 2024Updated last year
- ☆15Oct 10, 2023Updated 2 years ago
- optimized wav2lip☆18Jan 6, 2024Updated 2 years ago
- Speech-Driven Expression Blendshape Based on Single-Layer Self-attention Network (AIWIN 2022)☆78Oct 21, 2022Updated 3 years ago
- ☆10Feb 17, 2023Updated 3 years ago
- [BMVC'24] G3FA: Geometry-guided GAN for Face Animation☆20Mar 14, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆21Dec 9, 2023Updated 2 years ago
- Voice conversion with just linear regression.☆37Sep 25, 2025Updated 6 months ago
- ☆10Nov 19, 2023Updated 2 years ago
- ☆102Nov 26, 2025Updated 4 months ago
- ☆28Oct 1, 2023Updated 2 years ago
- [IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast☆22Oct 25, 2023Updated 2 years ago
- ☆21Mar 4, 2024Updated 2 years ago
- Tsinghua University SPMI Lab array processing toolkit☆18Nov 23, 2016Updated 9 years ago
- SyncNet for Time Synchronization☆30Mar 13, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Prompting Large Language Models with Audio for General-Purpose Speech Summarization☆20May 14, 2025Updated 10 months ago
- An implementation of http://openaccess.thecvf.com/content_CVPRW_2019/papers/Sight%20and%20Sound/Konstantinos_Vougioukas_End-to-End_Speech…☆18Mar 19, 2020Updated 6 years ago
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆31Sep 23, 2025Updated 6 months ago
- ☆15Oct 28, 2019Updated 6 years ago
- The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressions…☆22Dec 12, 2024Updated last year
- CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?☆131Dec 11, 2024Updated last year
- Auto-AVSR: Lip-Reading Sentences Project☆409Jan 8, 2025Updated last year
- ☆24Oct 8, 2021Updated 4 years ago
- ☆24Jul 15, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]☆296Jul 7, 2024Updated last year
- Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"☆15Oct 25, 2024Updated last year
- 基于MuseTalk的数字人代码。☆35Sep 14, 2024Updated last year
- A dataset collected from synchronized ad-hoc microphone arrays☆19Apr 24, 2023Updated 2 years ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- [3DV'24] GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar☆83May 18, 2024Updated last year
- This project is a real-time Wav2Lip implementation that I am actively optimizing to enhance the precision and performance of audio-to-lip…☆11Dec 6, 2023Updated 2 years ago