Amirrezahmi / AudioVisual-Fusion-SuiteLinks
In this project, we transferred the target from the first video to the second one. Additionally, we altered the characteristics of the source audio to match those of the target audio. We then blended these two projects into a single project.
☆22Updated 2 years ago
Alternatives and similar repositories for AudioVisual-Fusion-Suite
Users that are interested in AudioVisual-Fusion-Suite are comparing it to the libraries listed below
Sorting:
- This project analyzes tweets, extracting insights on a specific hashtag. It finds common words in hashtag-containing tweets and lists acc…☆11Updated 2 years ago
- Persian/Farsi text to speech(TTS) training using coqui tts☆195Updated 11 months ago
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆65Updated last year
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆68Updated last year
- This repository houses five notebooks containing Mathematica Wolfram commands along with their detailed descriptions in Persian. Explore …☆14Updated 2 years ago
- A collection of inspiring lists, repos, datasets, models, tools and more for Persian language speech to text(stt) and text to speech(tts)…☆86Updated last year
- Vid Driven Portrait Animation 🤢😷☆18Updated last year
- This is a HeadSwap project not only face☆34Updated 3 years ago
- CS Course Chronicles is a GitHub repository that documents my academic progress in computer science courses during my university studies.…☆13Updated 2 years ago
- Data Collector is an Android app that simplifies data collection and management. Easily enter questions and answers, maintain a dataset, …☆10Updated 2 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆34Updated 10 months ago
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video☆76Updated last year
- KAN-based Fusion of Dual Domain for Audio-Driven Landmarks Generation of the model can help you generate an sequence of facial lanmarks f…☆30Updated 3 months ago
- ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generation☆180Updated last year
- Windows Forms user interface for making lip sync videos with DINet and OpenFace☆25Updated 2 years ago
- A Real-Time High-Definition Teeth Restoration Network for ArbitraryTalking Face Generation Methods☆146Updated 2 years ago
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆81Updated 2 years ago
- [WACV 2024] "CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer"☆129Updated last year
- ☆64Updated last year
- unofficial implementation of Few-Shot Head Swapping in the Wild☆45Updated 2 years ago
- Persian text-to-speech streamlit interface☆45Updated last year
- Implementation of Megaportrait☆44Updated last year
- [ACM TOG, 2024] Identity-Preserving Face Swapping via Dual Surrogate Generative Models☆67Updated last year
- [IJCV 2024] Code for ReliTalk☆126Updated 2 years ago
- ☆10Updated 2 years ago
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆18Updated 2 years ago
- ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Sui…☆48Updated 6 months ago
- Preprocessing Scipts for Talking Face Generation☆92Updated last year
- PLPR utilizes YOLOv5 and custom models for high-accuracy Persian license plate recognition, featuring real-time processing and an intuiti…☆446Updated last year
- Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".☆214Updated last year