dsforza96 / visual-mic
Passive Recovery of Sound from Video
☆42Updated 4 years ago
Alternatives and similar repositories for visual-mic:
Users that are interested in visual-mic are comparing it to the libraries listed below
- Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL☆173Updated 2 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆167Updated 2 years ago
- Client-side air drawing tool☆184Updated 3 years ago
- Remote heart rate detection through Eulerian magnification of face videos☆329Updated 2 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated 2 years ago
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆87Updated 2 years ago
- ☆40Updated 6 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- Automated Reproducible Acoustical Analysis☆149Updated 7 months ago
- Performant and accurate speech recognition built on Pytorch☆253Updated 2 years ago
- Automated Lip reading from real-time videos in tensorflow in python☆162Updated 7 years ago
- PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."☆562Updated last year
- Direction-of-Voice (DoV) Estimation for Intuitive Speech Interaction with Smart Devices Ecosystems☆34Updated 2 years ago
- This repository implements T. Oh, R. Jaroensri, C. Kim, M. Elgharib, F. Durand, W. Freeman, W. Matusik "Learning-based Video Motion Magni …☆40Updated 5 years ago
- Python implementation of EVM(Eulerian Video Magnification)☆235Updated 2 years ago
- A large-scale publicly-available visual-thermal-audio dataset designed to encourage research in the general areas of user authentication,…☆81Updated 3 years ago
- The Cone of Silence:☆152Updated 2 years ago
- Understanding emotions from audio files using neural networks and multiple datasets.☆418Updated last year
- A collection of Audio and Speech pre-trained models.☆187Updated 4 years ago
- PMEmo: A Dataset For Music Emotion Computing☆105Updated 11 months ago
- VoiceStressAnalysis - Detects stress in your voice☆21Updated 6 months ago
- ☆16Updated 3 months ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆199Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆147Updated 11 months ago
- Code behind the work "Single Cortical Neurons as Deep Artificial Neural Networks", published in Neuron 2021☆150Updated 3 years ago
- ☆64Updated 4 years ago
- Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)☆415Updated last year
- Eulerian Video Magnification☆24Updated 4 years ago
- Dataset and source code for "CEAP-360VR: A Continuous Physiological and Behavioral Emotion Annotation Dataset for 360° Videos" in IEEE Tr…☆23Updated last year
- ☆27Updated last year