toxtli / AutomEditorLinks
AutomEditor is an AI based video editor that helps video bloggers to remove bloopers automatically. It uses multimodal spatio-temporal blooper recognition and localization approaches. The models were trained in keras and integrate feature fusion techniques from face, body gestures (skelethon), emotions progression, and audio features
☆47Updated 6 years ago
Alternatives and similar repositories for AutomEditor
Users that are interested in AutomEditor are comparing it to the libraries listed below
Sorting:
- Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.☆98Updated 2 years ago
- Final Project for Stanford Deep Generative Modeling Class CS236.☆14Updated 5 years ago
- AI-generated talking head video of fake people responding to your input question text.☆67Updated 4 years ago
- Allows you to edit videos automatically☆46Updated 4 months ago
- ☆23Updated 4 years ago
- A deep learning model to lip-sync a given video with any given audio. It uses GAN architecture to orchestrate loss reconstruction or trai…☆122Updated 2 years ago
- Animate image in real time using First Order Motion Model for Image Animation☆59Updated last year
- Video chat apps with computer vision filters built on top of Streamlit☆50Updated 2 years ago
- Code to detect scenes and transitions in videos and compose a video to visualize the data.☆28Updated 6 years ago
- Real-time speech to text with specific language translation.☆47Updated 5 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆13Updated 2 years ago
- It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.☆187Updated 4 months ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆122Updated 2 years ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆15Updated 5 years ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 2 years ago
- A software pipeline for creating realistic videos of people talking, using only images.☆38Updated 3 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆178Updated 2 years ago
- GANfolk are AI-generated renderings of fictional people. Each image in the collection was created by a pair of Generative Adversarial Net…☆35Updated 3 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆169Updated 5 years ago
- Speech to Facial Animation using GANs☆40Updated 3 years ago
- Real-Time Lip Sync for Live 2D Animation☆144Updated 5 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Learning Lip Sync of Obama from Speech Audio☆66Updated 5 years ago
- Trained a Deep Learning Model to generate automatically generate highlights from a sports video☆48Updated 7 years ago
- A curated list of awesome affective computing 🤖❤️ papers, software, open-source projects, and resources☆180Updated 5 years ago
- repo for active speaker detection for media videos.☆29Updated last year
- Turning films into structured data to unlock the vast wealth of emotional knowledge within.☆30Updated 3 years ago
- Semantically be able to search through a database of videos (using generated summaries)☆69Updated 7 years ago
- You Said That?: Synthesising Talking Faces from Audio☆70Updated 7 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago