Amirrezahmi / AudioVisual-Fusion-SuiteLinks
In this project, we transferred the target from the first video to the second one. Additionally, we altered the characteristics of the source audio to match those of the target audio. We then blended these two projects into a single project.
☆22Updated 2 years ago
Alternatives and similar repositories for AudioVisual-Fusion-Suite
Users that are interested in AudioVisual-Fusion-Suite are comparing it to the libraries listed below
Sorting:
- This project analyzes tweets, extracting insights on a specific hashtag. It finds common words in hashtag-containing tweets and lists acc…☆11Updated 2 years ago
- This repository houses five notebooks containing Mathematica Wolfram commands along with their detailed descriptions in Persian. Explore …☆14Updated 2 years ago
- CS Course Chronicles is a GitHub repository that documents my academic progress in computer science courses during my university studies.…☆13Updated 2 years ago
- Data Collector is an Android app that simplifies data collection and management. Easily enter questions and answers, maintain a dataset, …☆10Updated 2 years ago
- Persian/Farsi text to speech(TTS) training using coqui tts☆196Updated 10 months ago
- A collection of inspiring lists, repos, datasets, models, tools and more for Persian language speech to text(stt) and text to speech(tts)…☆87Updated last year
- unofficial implementation of Few-Shot Head Swapping in the Wild☆45Updated 2 years ago
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆65Updated last year
- Persian ASR dataset☆42Updated 2 years ago
- [ACM TOG, 2024] Identity-Preserving Face Swapping via Dual Surrogate Generative Models☆65Updated last year
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆18Updated 2 years ago
- Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appear…☆120Updated 2 months ago
- GFPGAN 1024☆49Updated 6 months ago
- Repository containing codebase for "FaceOff: A Video-to-Video Face Swapping Network" accepted at WACV 2023☆31Updated 2 years ago
- Unofficial implementation of the paper: StyleSwap: Style-Based Generator Empowers Robust Face Swapping☆52Updated 3 years ago
- PLPR utilizes YOLOv5 and custom models for high-accuracy Persian license plate recognition, featuring real-time processing and an intuiti…☆446Updated last year
- Persian OCR dateset☆80Updated 2 years ago
- [ICLR 2024] DAEFR: Dual Associated Encoder for Face Restoration☆53Updated last year
- ☆36Updated 11 months ago
- Vid Driven Portrait Animation 🤢😷☆18Updated last year
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video☆75Updated last year
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆68Updated last year
- This repository contains code for a Bank Account Management System implemented in Python. The system provides functionalities for creatin…☆18Updated 2 years ago
- Bert-Based persian spell-checker☆18Updated last year
- ☆13Updated 9 months ago
- ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generation☆180Updated last year
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆35Updated 5 months ago
- textfx is a Python library for creating dynamic and visually engaging text effects and Loading Animation.☆18Updated last month
- A simple face restoration TensorRT deployment solution.☆86Updated last year
- PersianQuAD: The Native Question Answering Dataset for the Persian Language (Kazemi et al. IEEE ACCESS 2022)☆13Updated 2 years ago