dsforza96 / visual-micLinks
Passive Recovery of Sound from Video
☆57Updated 5 years ago
Alternatives and similar repositories for visual-mic
Users that are interested in visual-mic are comparing it to the libraries listed below
Sorting:
- When sound hits an object, it causes small vibrations on the object’s surface. Here we show how, using only high-speed video of the objec…☆17Updated last year
- Tacotron2 Training Notebook for FakeYou.com☆167Updated 4 months ago
- Deep neural network trained to detect eye contact from facial image☆104Updated last year
- Code I used when filming Steve Mould’s heart rate! https://www.youtube.com/watch?v=BFZxlauizx0☆218Updated 3 years ago
- Tensorflow implementation of Learning-based Video Motion Magnification☆496Updated 7 years ago
- Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL☆178Updated 2 years ago
- Code for "HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields".☆954Updated last year
- Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.☆38Updated 4 years ago
- A streaming Speech to Text server using DeepSpeech☆16Updated 5 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆229Updated 3 years ago
- Making an AI-generated music video from any song with Wav2CLIP and VQGAN-CLIP☆242Updated 3 years ago
- Phase based video motion magnification☆145Updated 8 years ago
- Client-side air drawing tool☆189Updated 3 years ago
- This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech S…☆713Updated 2 years ago
- A Python application that does noise cancellation☆180Updated 3 years ago
- Framework to imitate writing styles using deep learning☆95Updated 3 years ago
- Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution☆165Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆360Updated 2 years ago
- ☆83Updated 2 years ago
- LoGAN: Generating Logos with a Generative Adversarial Neural Network Conditioned on color☆50Updated 7 years ago
- Residual Shuffle-Exchange☆78Updated 3 years ago
- Automated Lip reading from real-time videos in tensorflow in python☆162Updated 7 years ago
- ☆429Updated last year
- DeepPrivacy2 - A Toolbox for Realistic Image Anonymization☆363Updated 2 years ago
- AI Music Generation for the Real World☆252Updated 2 years ago
- ML models for Uberduck☆381Updated last year
- Symbolic Music Genre Transfer with CycleGAN☆279Updated 4 years ago
- This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…☆613Updated 7 months ago
- Feel electric current... with your finger.☆22Updated 5 years ago
- Symphony Generation with Permutation Invariant Language Model☆256Updated 3 years ago