de-id / video-diffusion-papers
This seminar will focus on the latest developments in the field of diffusion models, particularly video diffusion models. Topics will include aspects such as temporal and identity consistency, efficiency, and applications specifically in the realm of human avatars.
☆14Updated 5 months ago
Alternatives and similar repositories for video-diffusion-papers:
Users that are interested in video-diffusion-papers are comparing it to the libraries listed below
- Diffusion Models papers☆19Updated last year
- Use D-ID's live streaming API to stream a talking presenter☆191Updated last week
- [ECCV 2022] Official PyTorch implementation of the paper - Graph Neural Network for Cell Tracking in Microscopy Videos☆67Updated 2 years ago
- ☆117Updated 11 months ago
- ☆10Updated 7 months ago
- Work with OpenAI's streaming API at ease with Python generators☆121Updated 10 months ago
- Avatar Generation For Characters and Game Assets Using Deep Fakes☆217Updated 7 months ago
- 🔥🔥🔥 Set the world of 3D faces on fire with INFERNO 🔥🔥🔥☆221Updated 4 months ago
- [CVPR'23] Learning Neural Parametric Head Models☆257Updated 10 months ago
- Visual interpretability of image-based classification models by generative latent space disentanglement☆12Updated 9 months ago
- Alternative to Flawless AI's TrueSync. Make lips in video match provided audio using the power of Wav2Lip and GFPGAN.☆122Updated 9 months ago
- Summary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model☆502Updated last week
- Nadir is a Python package designed to dynamically choose the best llm for your prompt by balancing complexity and cost and response time.…☆40Updated 2 months ago
- ☆242Updated 3 months ago
- A pytest plugin for running and analyzing LLM evaluation tests.☆120Updated 2 months ago
- This is a short example showing how to utilize Amazon SageMaker's real time endpoints with OpenAI's open source Whisper model for audio t…☆67Updated last year
- ☆351Updated 8 months ago
- Hands-on workshop for distributed training and hosting on SageMaker☆135Updated 2 weeks ago
- Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image…☆757Updated 4 months ago
- ☆21Updated last week
- The Amazon Transcribe Streaming SDK is an async Python SDK for converting audio into text via Amazon Transcribe.☆165Updated 2 months ago
- An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing …☆105Updated last year
- PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)☆363Updated 3 months ago
- Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation☆483Updated last year
- DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models☆271Updated last month
- API playground for Deepgram built with Streamlit☆22Updated last year
- High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGAN☆446Updated last year
- Evaluating the quality of a cell segmentation method without reference.☆10Updated last week
- ☆155Updated last year
- A question answering dataset in Modern Hebrew, containing 30,147 questions.☆22Updated 4 months ago