Amirrezahmi/AudioVisual-Fusion-Suite

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Amirrezahmi/AudioVisual-Fusion-Suite)

Amirrezahmi / AudioVisual-Fusion-Suite

In this project, we transferred the target from the first video to the second one. Additionally, we altered the characteristics of the source audio to match those of the target audio. We then blended these two projects into a single project.

☆22

Alternatives and similar repositories for AudioVisual-Fusion-Suite

Users that are interested in AudioVisual-Fusion-Suite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Amirrezahmi / Image-Decoding
View on GitHub
Unveil hidden messages within images using Minesweeper-inspired decoding. Left-click to reveal clues, right-click to flag suspected mines…
☆36Jul 23, 2023Updated 3 years ago
Amirrezahmi / SelfTalker
View on GitHub
Engage in conversation with your virtual self using AI techniques like NLP, voice cloning, and computer vision. Get accurate answers with…
☆86Jul 23, 2023Updated 3 years ago
Amirrezahmi / Zozo-Assistant
View on GitHub
Zozo Assistant is a voice-activated chatbot that performs tasks based on user commands. It utilizes speech recognition, NLP, and ML to pr…
☆62Aug 10, 2023Updated 2 years ago
Amirrezahmi / CS-Course-Chronicles
View on GitHub
CS Course Chronicles is a GitHub repository that documents my academic progress in computer science courses during my university studies.…
☆13Aug 8, 2023Updated 2 years ago
Amirrezahmi / data-collector
View on GitHub
Data Collector is an Android app that simplifies data collection and management. Easily enter questions and answers, maintain a dataset, …
☆10Jul 12, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Amirrezahmi / Hashtag-Analyzer
View on GitHub
This project analyzes tweets, extracting insights on a specific hashtag. It finds common words in hashtag-containing tweets and lists acc…
☆11May 28, 2023Updated 3 years ago
Amirrezahmi / Mathematica-Wolfram-notebooks
View on GitHub
This repository houses five notebooks containing Mathematica Wolfram commands along with their detailed descriptions in Persian. Explore …
☆14Jul 4, 2023Updated 3 years ago
Amirrezahmi / IG-Curiosity-Analyzer
View on GitHub
Analyze the curiosity of your Instagram followers by calculating their engagement with your stories. Identify 100% curious followers and …
☆15Jul 14, 2023Updated 3 years ago
Amirrezahmi / bank-management
View on GitHub
This repository contains code for a Bank Account Management System implemented in Python. The system provides functionalities for creatin…
☆18Jun 22, 2023Updated 3 years ago
osmr / propainter
View on GitHub
Streaming ProPainter
☆15Sep 18, 2024Updated last year
peterwisu / lip-synthesis
View on GitHub
Audio-Visual Lip Synthesis via Intermediate Landmark Representation
☆19May 16, 2023Updated 3 years ago
Inferencer / SickFace
View on GitHub
Vid Driven Portrait Animation 🤢😷
☆18Jul 7, 2024Updated 2 years ago
chuckkay / QueueItUp-addon
View on GitHub
The 1 file drop in Add On - for Queueing Jobs in Facefusion
☆17Aug 2, 2025Updated 11 months ago
ykk648 / face_power
View on GitHub
Face_lib separate from AI_Power
☆27Nov 10, 2025Updated 8 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
sowwnn / KFusion-Dual-Domain-for-Speech-to-Landmarks
View on GitHub
KAN-based Fusion of Dual Domain for Audio-Driven Landmarks Generation of the model can help you generate an sequence of facial lanmarks f…
☆32Oct 28, 2025Updated 9 months ago
kealiu / ComfyUI-Zero123-Porting
View on GitHub
ComfyUI Node for Zero-1-to-3: Zero-shot One Image to 3D Object
☆22Aug 22, 2024Updated last year
natlamir / DINet-UI
View on GitHub
Windows Forms user interface for making lip sync videos with DINet and OpenFace
☆26Oct 14, 2023Updated 2 years ago
vincentmvdm / for-me-page
View on GitHub
Arc Boost that removes bad tweets using AI.
☆89May 24, 2023Updated 3 years ago
chrisgoringe / cg-noise
View on GitHub
☆26Aug 7, 2024Updated last year
SOELexicon / ComfyUI-LexTools
View on GitHub
Bunch of custom nodes; Segformer - allows you to segment images by specifying the segment in an array
☆32Mar 28, 2025Updated last year
zyj-2000 / THQA
View on GitHub
Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"
☆39Jul 23, 2025Updated last year
ZonglinL / ConsecutiveBrownianBridge
View on GitHub
[ACM MM 2024] Frame Interpolation with Consecutive Brownian Bridge Diffusion Model
☆37Feb 22, 2025Updated last year
jinny960812 / SyncTalkFace
View on GitHub
SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory
☆33Nov 3, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
processone / xmpp
View on GitHub
Erlang/Elixir XMPP parsing and serialization library on top of Fast XML
☆150Jul 21, 2026Updated last week
langzizhixin / IP_LAP_GFPGAN
View on GitHub
☆33Feb 8, 2025Updated last year
JarodMica / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆36May 17, 2025Updated last year
RisingEntropy / LPFInISR
View on GitHub
Official repository for paper: Exploring the Low-Pass Filtering Behavior in Image Super-Resolution
☆41Jul 11, 2024Updated 2 years ago
CVI-SZU / GazeFlow
View on GitHub
This is the official implement of GazeFlow.
☆49Feb 12, 2022Updated 4 years ago
instant-high / wav2lip-onnx-256
View on GitHub
Simple and fast wav2lip using new 256x256 resolution trained onnx-converted model for inference. Easy installation
☆47Oct 13, 2024Updated last year
jeffy5 / comfyui-faceless-node
View on GitHub
Next generation face toolkit for ComfyUI.
☆61Apr 3, 2026Updated 3 months ago
SJTU-Lucy / EmoFace
View on GitHub
☆58Jul 9, 2025Updated last year
HallowSiddharth / VoiceCraftAI
View on GitHub
VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.
☆71Oct 8, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
rlgnswk / NeRFFaceSpeech_Code
View on GitHub
One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024
☆65Oct 24, 2024Updated last year
liutaocode / DiffDub
View on GitHub
[ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder
☆70Jul 21, 2024Updated 2 years ago
community / events
View on GitHub
☆216Mar 15, 2024Updated 2 years ago
pawansharmaaaa / Lip_Wise
View on GitHub
Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.
☆79Jun 19, 2025Updated last year
CVMI-Lab / Speech2Lip
View on GitHub
[ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
☆76Mar 28, 2024Updated 2 years ago
zzj1111 / Preprocessed-CMLR-Dataset-For-Wav2Lip
View on GitHub
Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would t…
☆63Sep 23, 2023Updated 2 years ago
ICTMCG / CSCS
View on GitHub
[ACM TOG, 2024] Identity-Preserving Face Swapping via Dual Surrogate Generative Models
☆71Jan 9, 2025Updated last year