cainky / Lipsync
GUI to sync video mouth movements to match audio, utilizing wav2lip-hq. Completed as part of a technical interview.
☆11Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for Lipsync
- optimized wav2lip☆19Updated 10 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆64Updated 4 months ago
- Full version of wav2lip-onnx including face alignment and face enhancement and more...☆70Updated last month
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆49Updated 8 months ago
- Wav2Lip UHQ Improvement with ControlNet 1.1☆73Updated last year
- This project fixes the Wav2Lip project so that it can run on Python 3.9. Wav2Lip is a project that can be used to lip-sync videos to audi…☆18Updated last year
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆22Updated last month
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆31Updated 2 years ago
- Talking head animation☆27Updated 11 months ago
- ☆30Updated 10 months ago
- Simple and fast wav2lip using new 256x256 resolution trained onnx-converted model for inference. Easy installation☆27Updated last month
- ☆10Updated last year
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆35Updated 9 months ago
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆14Updated last year
- 基于DINet的推理服务,推理视频流和视频☆13Updated last year
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆78Updated 10 months ago
- ☆12Updated last year
- Updated fork of wav2lip-hq allowing for the use of current ESRGAN models☆49Updated 6 months ago
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆26Updated last year
- ☆27Updated last year
- ☆41Updated 10 months ago
- ☆31Updated last year
- ☆16Updated 5 months ago
- ☆25Updated 8 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆64Updated last year
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆55Updated 3 weeks ago
- ☆21Updated last month