toxtli / AutomEditor
AutomEditor is an AI based video editor that helps video bloggers to remove bloopers automatically. It uses multimodal spatio-temporal blooper recognition and localization approaches. The models were trained in keras and integrate feature fusion techniques from face, body gestures (skelethon), emotions progression, and audio features
☆48Updated 5 years ago
Alternatives and similar repositories for AutomEditor:
Users that are interested in AutomEditor are comparing it to the libraries listed below
- Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.☆98Updated 2 years ago
- Final Project for Stanford Deep Generative Modeling Class CS236.☆14Updated 5 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 6 years ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆13Updated 4 years ago
- Learning Lip Sync of Obama from Speech Audio☆67Updated 4 years ago
- Turning films into structured data to unlock the vast wealth of emotional knowledge within.☆30Updated 2 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Real-Time Lip Sync for Live 2D Animation☆138Updated 5 years ago
- Pytorch implementation of Dance Dance Generation: Motion Transfer for Internet Videos☆44Updated 5 years ago
- Conditional lyrics generator -> pre-trained GPT2 model fine-tuned on lyrics with features dataset.☆40Updated 5 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated 2 years ago
- Algorithms for Intelligent Assessment of Human Personality Traits based on His Multimodal Data for ranking potential candidates to perfo…☆35Updated 4 months ago
- ☆45Updated 2 years ago
- A deep learning model to lip-sync a given video with any given audio. It uses GAN architecture to orchestrate loss reconstruction or trai…☆117Updated 2 years ago
- This is the repository containing the solution for FG-2020 ABAW Competition☆117Updated 11 months ago
- ☆24Updated 6 years ago
- Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"☆100Updated 4 years ago
- Image Processing, Speech Processing, Encoder Decoder, Research Paper implementation☆60Updated 5 years ago
- Allows you to edit videos automatically☆44Updated 3 years ago
- A modified version of vid2vid for Speech2Video, Text2Video Paper☆35Updated last year
- A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication,…☆81Updated 3 years ago
- Convert Text to Audio + Video == Youtube Video !!☆38Updated 4 years ago
- Code to detect scenes and transitions in videos and compose a video to visualize the data.☆28Updated 6 years ago
- Automated Web Video Editing Tool☆72Updated 2 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆83Updated 3 years ago
- Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".☆10Updated 4 years ago
- Human Emotion Understanding using multimodal dataset.☆97Updated 4 years ago
- Audio driven video synthesis☆41Updated 2 years ago
- Accompany code to reproduce the baselines of the International Multimodal Sentiment Analysis Challenge (MuSe 2020).☆16Updated 2 years ago
- Repository for th OMG Emotion Challenge☆89Updated 4 months ago