toxtli / AutomEditorLinks
AutomEditor is an AI based video editor that helps video bloggers to remove bloopers automatically. It uses multimodal spatio-temporal blooper recognition and localization approaches. The models were trained in keras and integrate feature fusion techniques from face, body gestures (skelethon), emotions progression, and audio features
☆47Updated 6 years ago
Alternatives and similar repositories for AutomEditor
Users that are interested in AutomEditor are comparing it to the libraries listed below
Sorting:
- Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.☆98Updated 2 years ago
- Final Project for Stanford Deep Generative Modeling Class CS236.☆14Updated 5 years ago
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 7 years ago
- Learning Lip Sync of Obama from Speech Audio☆66Updated 4 years ago
- A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication,…☆82Updated 4 years ago
- Real-Time Lip Sync for Live 2D Animation☆142Updated 5 years ago
- A curated list of awesome affective computing 🤖❤️ papers, software, open-source projects, and resources☆176Updated 5 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- A deep learning model to lip-sync a given video with any given audio. It uses GAN architecture to orchestrate loss reconstruction or trai…☆120Updated 2 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆172Updated 2 years ago
- AI-generated talking head video of fake people responding to your input question text.☆68Updated 4 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆107Updated last year
- Accompany code to reproduce the baselines of the International Multimodal Sentiment Analysis Challenge (MuSe 2020).☆16Updated 2 years ago
- A modified version of vid2vid for Speech2Video, Text2Video Paper☆35Updated 2 years ago
- Speech to Facial Animation using GANs☆40Updated 3 years ago
- ☆24Updated 6 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆13Updated 2 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Image Processing, Speech Processing, Encoder Decoder, Research Paper implementation☆64Updated 5 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆170Updated 4 years ago
- Code and models for evaluating a state-of-the-art lip reading network☆195Updated 2 years ago
- Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL☆176Updated 2 years ago
- Animate image in real time using First Order Motion Model for Image Animation☆58Updated last year
- Turning films into structured data to unlock the vast wealth of emotional knowledge within.☆30Updated 3 years ago
- An automatic movie trailer generator.☆41Updated 2 years ago
- Code to detect scenes and transitions in videos and compose a video to visualize the data.☆28Updated 6 years ago
- Converter Video to Anime. Based on "AnimeGAN" repository.☆38Updated 5 years ago
- Automated Lip reading from real-time videos in tensorflow in python☆164Updated 7 years ago
- A gui to help make a text to speech dataset.☆18Updated 2 years ago
- Automated Web Video Editing Tool☆73Updated 2 years ago