MetaGenAI / multimodal-transflowerLinks
multimodal probabilistic autoregressive models
☆19Updated last year
Alternatives and similar repositories for multimodal-transflower
Users that are interested in multimodal-transflower are comparing it to the libraries listed below
Sorting:
- multimodal transformer☆74Updated 3 years ago
- The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.☆55Updated 4 years ago
- The official implementation for ICMI 2020 Best Paper Award "Gesticulator: A framework for semantically-aware speech-driven gesture gener…☆127Updated 2 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆91Updated 3 years ago
- ☆101Updated last year
- General Prior for Anime - 1☆44Updated 2 years ago
- The project page repo for Neural Dubber.☆30Updated last year
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109Updated 3 years ago
- This is the official implementation for IVA '19 paper "Analyzing Input and Output Representations for Speech-Driven Gesture Generation".☆110Updated 2 years ago
- Demo for 2022 Interspeech☆29Updated 3 years ago
- This repository contains the gesture generation model from the paper "Moving Fast and Slow" (https://www.tandfonline.com/doi/full/10.1080…☆25Updated 2 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆122Updated 2 years ago
- This is the official implementation for IVA'20 Best Paper Award paper "Let's Face It: Probabilistic Multi-modal Interlocutor-aware Gener…☆16Updated 2 years ago
- This is an official PyTorch implementation of "Gesture2Vec: Clustering Gestures using Representation Learning Methods for Co-speech Gestu…☆26Updated last year
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆34Updated 5 months ago
- [ECCV2022] D2M-GAN for music generation from dance videos☆86Updated 3 years ago
- Finally, some decent sample sentences☆23Updated last year
- ☆30Updated 2 years ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆109Updated 3 years ago
- Demo for 2022 ICASSP☆64Updated 3 years ago
- This repository contains the dataset used in paper "ChoreoMaster: Choreography -Oriented Music Driven Dance Synthesis".☆116Updated 3 years ago
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆86Updated 5 years ago
- Talking with Hands☆92Updated 4 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆86Updated last year
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆39Updated last year
- Code & demo for the animation of still facial landmarks from an initial pose.☆15Updated 2 years ago
- Code for "Animating Portrait Line Drawings from a Single Face Photo and a Speech Signal"☆58Updated 3 years ago
- ☆24Updated 2 years ago