yxdydgithub/difftalk_preprocess

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yxdydgithub/difftalk_preprocess)

yxdydgithub / difftalk_preprocess

☆13

Alternatives and similar repositories for difftalk_preprocess

Users that are interested in difftalk_preprocess are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sstzal / DiffTalk
View on GitHub
[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"
☆472Jul 15, 2024Updated 2 years ago
Ganesh-tamang / LivePortrait_video
View on GitHub
☆14Jul 17, 2024Updated 2 years ago
TheDenk / InstantID-SD1.5
View on GitHub
InstantID for StableDiffusion 1.5.
☆11Jul 6, 2024Updated 2 years ago
KEAML-JLU / SimSTC
View on GitHub
The source code for "A Simple Graph Contrastive Learning Framework for Short Text Classification"
☆13Aug 14, 2025Updated 11 months ago
MingtaoGuo / Relightable-Portrait-Animation
View on GitHub
[CVPR 2025] High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model
☆60Jun 4, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
CyberAgentAILab / regularized-bon
View on GitHub
Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).
☆14Apr 4, 2025Updated last year
uniBruce / Mead
View on GitHub
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]
☆306Jul 7, 2024Updated 2 years ago
MontaEllis / SD-GAN
View on GitHub
This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.
☆13Mar 18, 2024Updated 2 years ago
haoningwu3639 / SimpleSDM-Video
View on GitHub
A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.
☆20Feb 15, 2024Updated 2 years ago
theEricMa / DiffSpeaker
View on GitHub
DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer
☆166Mar 31, 2024Updated 2 years ago
vvvwo / CG_Lesson_Beginner
View on GitHub
This is a computer graphics course for entry-level learner
☆12Oct 23, 2024Updated last year
zhutengjie / Ref-MC2-Code
View on GitHub
[NeurIPS 2024] Official PyTorch implementation of ”Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction“.
☆22Apr 14, 2025Updated last year
PeterFanFan / Emospeaker_code
View on GitHub
☆64Mar 26, 2024Updated 2 years ago
DinoMan / face-processor
View on GitHub
Aligns faces to the canonical face in both videos and images
☆17Apr 11, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Likon69 / CopilotBuddy
View on GitHub
Public WotLK 3.3.5a bot in C#/WPF. API surface ported from Honorbuddy, retargeted at build 12340 and custom servers. │ Botbases, navig…
☆15Jul 13, 2026Updated last week
MANLP-suda / MMESGN
View on GitHub
Transformer-based Label Set Generation for Multi-modal Multi-label Emotion Detection
☆14Dec 16, 2021Updated 4 years ago
yiranran / Predict-Personalized-Head-Movement-TMM
View on GitHub
Code for "Predicting Personalized Head Movement from Short Video and Speech Signal" (TMM)
☆16Mar 31, 2023Updated 3 years ago
MegEngine / awesome-megengine
View on GitHub
Awesome Resources about MegEngine
☆16Mar 2, 2023Updated 3 years ago
ayushgupta9198 / Avatarify
View on GitHub
It means to reciprocate the motion from video to human face and looks like the real man talking video. In this module you will find the …
☆13Oct 28, 2020Updated 5 years ago
langzizhixin / wav2lip-576x576
View on GitHub
This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…
☆56Mar 18, 2024Updated 2 years ago
yl4467 / singer
View on GitHub
☆15Feb 22, 2025Updated last year
MRzzm / HDTF
View on GitHub
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
☆429May 12, 2024Updated 2 years ago
SerenaTetart / MultiboxBot
View on GitHub
MultiboxBot is a bot for multiboxing on WoW with up to 40 accounts using DLL injection, hooking and sockets.
☆18Jul 8, 2026Updated last week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cvlab-kaist / Talk3D
View on GitHub
☆139Apr 24, 2024Updated 2 years ago
FudanCVL / AVI-Bench
View on GitHub
[ICML'26] Toward Human-like Audio-Visual Intelligence of Omni-MLLMs
☆15Jun 20, 2026Updated last month
ebonyfaye / ema
View on GitHub
☆10Feb 12, 2026Updated 5 months ago
dfki-av / G3FA
View on GitHub
[BMVC'24] G3FA: Geometry-guided GAN for Face Animation
☆20Mar 14, 2025Updated last year
oneCodeSuperman / wav2lip_hq_trt
View on GitHub
这是一个在wav2lip，使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理！经测试在2070显卡上可达到0.03秒每帧实现实时推理。
☆31Sep 23, 2025Updated 9 months ago
ShunyuYao / DFA-NeRF
View on GitHub
☆72Jun 4, 2023Updated 3 years ago
google / tim-gan
View on GitHub
☆11Dec 11, 2020Updated 5 years ago
andrerochow / fsrt
View on GitHub
Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appear…
☆125Oct 28, 2025Updated 8 months ago
Likekekeke / EasyGaze3D
View on GitHub
Official repository of EasyGaze3D: Towards Effective and Flexible 3D Gaze Estimation from a Single RGB Camera
☆10Aug 3, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GeWu-Lab / Stepping-Stones
View on GitHub
The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024
☆18Oct 11, 2024Updated last year
Intelligent-Microsystems-Lab / QuantizedSNNs
View on GitHub
This repository contains the models and training scripts used in the papers: "Quantizing Spiking Neural Networks with Integers" (ICONS 20…
☆13Oct 20, 2020Updated 5 years ago
AiArt-Gao / FaceParsing-SegNeXt
View on GitHub
Face Parsing via SegNeXt, trained on CelebAMask-HQ
☆21Dec 21, 2023Updated 2 years ago
MANLP-suda / HHMPN
View on GitHub
Multi-modal Multi-label Emotion Recognition with Heterogeneous Hierarchical Message Passing
☆18Sep 24, 2022Updated 3 years ago
faraday / runway-stable-diffusion-inpainting
View on GitHub
Runway Inpainting based on Stable Diffusion
☆31Oct 18, 2022Updated 3 years ago
ZPdesu / HairNet
View on GitHub
HairNet: Hairstyle Transfer with Pose Changes
☆18Jul 20, 2022Updated 4 years ago
GATECH-EIC / FracTrain
View on GitHub
[NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…
☆10Feb 13, 2022Updated 4 years ago