de-id / video-diffusion-papersLinks
This seminar will focus on the latest developments in the field of diffusion models, particularly video diffusion models. Topics will include aspects such as temporal and identity consistency, efficiency, and applications specifically in the realm of human avatars.
☆14Updated 7 months ago
Alternatives and similar repositories for video-diffusion-papers
Users that are interested in video-diffusion-papers are comparing it to the libraries listed below
Sorting:
- Diffusion Models papers☆20Updated 2 years ago
- Use D-ID's live streaming API to stream a talking presenter☆201Updated 2 weeks ago
- Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation☆486Updated last year
- Harness the power of NVIDIA technologies and LangChain to create dynamic avatars from live speech, integrating RIVA ASR and TTS with Audi…☆74Updated 11 months ago
- ☆14Updated last year
- Official Python SDK for Deepgram.☆322Updated this week
- Avatar Generation For Characters and Game Assets Using Deep Fakes☆221Updated 10 months ago
- AvaChat - is a realtime AI chat demo with animated talking heads - it uses Large Language Models via api (OpenAI and Claude) as text inpu…☆103Updated last month
- PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)☆370Updated 5 months ago
- Talking head video AI generator☆78Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆133Updated last year
- AWS Setup for setting up streaming for OpenAI endpoint☆23Updated 2 years ago
- ☆127Updated last year
- FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.☆381Updated 2 years ago
- ☆359Updated 10 months ago
- Python client for Hume AI☆123Updated this week
- Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)☆260Updated last year
- ☆226Updated last year
- 📖 A curated list of resources dedicated to talking face.☆1,504Updated 6 months ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆38Updated last year
- ☆832Updated last year
- Out of time: automated lip sync in the wild☆779Updated last year
- code for paper "Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion" in the conference of IJCAI 2021☆345Updated last year
- ☆76Updated 2 months ago
- API playground for Deepgram built with Streamlit☆22Updated last year
- High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN☆467Updated last year
- Code for MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement☆385Updated 2 years ago
- The Amazon Transcribe Streaming SDK is an async Python SDK for converting audio into text via Amazon Transcribe.☆174Updated last month
- [CVPR 2023] CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior☆581Updated last year
- This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"☆376Updated last year