MetaGenAI / multimodal-transflowerLinks
multimodal probabilistic autoregressive models
☆19Updated 2 years ago
Alternatives and similar repositories for multimodal-transflower
Users that are interested in multimodal-transflower are comparing it to the libraries listed below
Sorting:
- multimodal transformer☆75Updated 3 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆92Updated 3 years ago
- The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.☆56Updated 4 years ago
- [ECCV2022] D2M-GAN for music generation from dance videos☆85Updated 3 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆122Updated 3 years ago
- The project page repo for Neural Dubber.☆30Updated 2 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆34Updated 8 months ago
- Talking Face Generation by Conditional Recurrent Adversarial Network☆61Updated 5 years ago
- General Prior for Anime - 1☆44Updated 2 years ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109Updated 3 years ago
- iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis☆53Updated 2 years ago
- The official implementation for ICMI 2020 Best Paper Award "Gesticulator: A framework for semantically-aware speech-driven gesture gener…☆127Updated 2 years ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆39Updated 2 years ago
- This repository contains the dataset used in paper "ChoreoMaster: Choreography -Oriented Music Driven Dance Synthesis".☆117Updated 4 years ago
- Code for "Animating Portrait Line Drawings from a Single Face Photo and a Speech Signal"☆57Updated 3 years ago
- The original weights of some Caffe models, ported to PyTorch.☆11Updated 3 years ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆110Updated 3 years ago
- Toward Spatially Unbiased Generative Models (ICCV 2021)☆90Updated 4 years ago
- Text-to-video generation.☆10Updated 3 years ago
- Demo for 2022 Interspeech☆29Updated 3 years ago
- Semantic image editing in realtime with a multi-parameter interface for StyleCLIP global directions☆13Updated 3 years ago
- This is the official implementation for IVA'20 Best Paper Award paper "Let's Face It: Probabilistic Multi-modal Interlocutor-aware Gener…☆16Updated 2 years ago
- ☆102Updated 3 weeks ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- An improved version of APB2Face: Real-Time Audio-Guided Multi-Face Reenactment☆84Updated 4 years ago
- ☆30Updated 3 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- Anim-400K: A dataset designed from the ground up for automated dubbing of video☆109Updated last year
- Looking up a generative latent vectors from (face) reference images.☆33Updated 6 years ago
- You Said That?: Synthesising Talking Faces from Audio☆70Updated 7 years ago