☆13May 11, 2024Updated 2 years ago
Alternatives and similar repositories for difftalk_preprocess
Users that are interested in difftalk_preprocess are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] Official PyTorch implementation of ”Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction“.☆20Apr 14, 2025Updated last year
- [CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"☆472Jul 15, 2024Updated last year
- InstantID for StableDiffusion 1.5.☆11Jul 6, 2024Updated last year
- A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.☆20Feb 15, 2024Updated 2 years ago
- The source code for "A Simple Graph Contrastive Learning Framework for Short Text Classification"☆13Aug 14, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]☆302Jul 7, 2024Updated last year
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated 2 years ago
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated last year
- This is a computer graphics course for entry-level learner☆12Oct 23, 2024Updated last year
- DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer☆168Mar 31, 2024Updated 2 years ago
- Aligns faces to the canonical face in both videos and images☆17Apr 11, 2022Updated 4 years ago
- Transformer-based Label Set Generation for Multi-modal Multi-label Emotion Detection☆14Dec 16, 2021Updated 4 years ago
- ☆64Mar 26, 2024Updated 2 years ago
- ☆14Jul 17, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for "Predicting Personalized Head Movement from Short Video and Speech Signal" (TMM)☆16Mar 31, 2023Updated 3 years ago
- [BMVC'24] G3FA: Geometry-guided GAN for Face Animation☆20Mar 14, 2025Updated last year
- It means to reciprocate the motion from video to human face and looks like the real man talking video. In this module you will find the …☆13Oct 28, 2020Updated 5 years ago
- MultiboxBot is a bot for multiboxing on WoW with up to 25 accounts using DLL injection, hooking and sockets.☆17May 5, 2026Updated 2 weeks ago
- Awesome Resources about MegEngine☆16Mar 2, 2023Updated 3 years ago
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆55Mar 18, 2024Updated 2 years ago
- the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"☆428May 12, 2024Updated 2 years ago
- ☆15Feb 22, 2025Updated last year
- ☆139Apr 24, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆21Oct 4, 2025Updated 7 months ago
- ☆10Feb 12, 2026Updated 3 months ago
- ☆72Jun 4, 2023Updated 2 years ago
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆31Sep 23, 2025Updated 7 months ago
- Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appear…☆125Oct 28, 2025Updated 6 months ago
- [CVPR 2025] High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model☆60Jun 4, 2025Updated 11 months ago
- Official repository of EasyGaze3D: Towards Effective and Flexible 3D Gaze Estimation from a Single RGB Camera☆10Aug 3, 2023Updated 2 years ago
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆18Oct 11, 2024Updated last year
- Runway Inpainting based on Stable Diffusion☆29Oct 18, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Face Parsing via SegNeXt, trained on CelebAMask-HQ☆21Dec 21, 2023Updated 2 years ago
- Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing☆18Sep 24, 2022Updated 3 years ago
- This repository contains the models and training scripts used in the papers: "Quantizing Spiking Neural Networks with Integers" (ICONS 20…☆13Oct 20, 2020Updated 5 years ago
- A simple semi-automatic labelling tool for semantic segmention masks using SAM as support.☆15Apr 17, 2024Updated 2 years ago
- HairNet: Hairstyle Transfer with Pose Changes☆18Jul 20, 2022Updated 3 years ago
- 中文短文本数据集,用于短文本分类研究,涉及情感分类、多分类等,发布的中文公开短文本数据集☆19Aug 16, 2024Updated last year
- Code and data for the CVPR24 paper "EFHQ: Multi-purpose ExtremePose-Face-HQ dataset" [CVPR'24]☆29Jul 23, 2024Updated last year