toxtli / AutomEditor
AutomEditor is an AI based video editor that helps video bloggers to remove bloopers automatically. It uses multimodal spatio-temporal blooper recognition and localization approaches. The models were trained in keras and integrate feature fusion techniques from face, body gestures (skelethon), emotions progression, and audio features
☆45Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for AutomEditor
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Updated last year
- Allows you to edit videos automatically☆43Updated 3 years ago
- Final Project for Stanford Deep Generative Modeling Class CS236.☆14Updated 4 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆50Updated 2 years ago
- ☆24Updated 5 years ago
- An automatic movie trailer generator.☆40Updated last year
- You Said That?: Synthesising Talking Faces from Audio☆69Updated 6 years ago
- Learning Lip Sync of Obama from Speech Audio☆67Updated 4 years ago
- A Python Library for Multimodal Analysis of Movies and Content-based Movie Recommendation☆28Updated 2 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- Real-time human emotion detection and analysis through voice and speech pattern processing☆23Updated 5 years ago
- ☆64Updated 3 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆103Updated 5 months ago
- AI-generated talking head video of fake people responding to your input question text.☆68Updated 3 years ago
- A "talking head" project capable of displaying emotions created using blender and python☆19Updated 6 years ago
- This is a phonemic multilingual (Russian-English) Implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-S…☆51Updated 4 years ago
- Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.☆43Updated 3 months ago
- The library is useful for analyzing the emotions present in any audio file(call/music/recordings) into three classes namely positive, neg…☆31Updated 8 years ago
- Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.☆96Updated last year
- A modified version of vid2vid for Speech2Video, Text2Video Paper☆35Updated last year
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆167Updated 4 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆78Updated 2 years ago
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆10Updated last year
- Speech to Facial Animation using GANs☆41Updated 3 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆68Updated 5 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- A classification model in Machine Learning capable of recognizing human facial emotions☆23Updated 6 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆58Updated 3 years ago
- Speech-conditioned face generation using Generative Adversarial Networks☆87Updated last year
- Gaze estimation from 2D image☆11Updated 8 months ago