dsforza96 / visual-micLinks
Passive Recovery of Sound from Video
☆49Updated 4 years ago
Alternatives and similar repositories for visual-mic
Users that are interested in visual-mic are comparing it to the libraries listed below
Sorting:
- ☆34Updated 11 years ago
- Tacotron2 Training Notebook for FakeYou.com☆164Updated 3 weeks ago
- Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL☆176Updated 2 years ago
- ☆63Updated 4 years ago
- Tooling for producing Italian model (public release available) for DeepSpeech and text corpus☆94Updated 3 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆170Updated 4 years ago
- Performant and accurate speech recognition built on Pytorch☆253Updated 3 years ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆436Updated 4 years ago
- The dataset of all music sheets and users on musescore.com (unmaintained/discontinued since Sep 30, 2021)☆259Updated 2 years ago
- This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech S…☆709Updated 2 years ago
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆45Updated last year
- Pipeline of a keylogging attack using just an audio signal and unsupervised learning.☆149Updated 2 years ago
- Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible…☆42Updated 9 months ago
- LoGAN: Generating Logos with a Generative Adversarial Neural Network Conditioned on color☆50Updated 6 years ago
- ☆105Updated last year
- WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆255Updated 5 years ago
- An ultrasonic directional speaker (aka. Parametric Speaker)☆143Updated 5 years ago
- Short overview over the components used by Lime Scooters fleet☆72Updated 3 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆229Updated 3 years ago
- List of common sample rates☆52Updated 7 years ago
- Client-side air drawing tool☆186Updated 3 years ago
- text-to-speech alignment java software☆20Updated 5 years ago
- Making an AI-generated music video from any song with Wav2CLIP and VQGAN-CLIP☆240Updated 3 years ago
- Open-source, low-cost, in-situ turbidity sensor for river network monitoring (prototype 2.0)☆20Updated last year
- Code for making music videos using CLIP☆174Updated 4 years ago
- Briand's project to turn an ESP32 into a tor client "plug&play"☆38Updated 3 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- Yet another esp32 watch☆50Updated last year
- This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synth…☆954Updated 3 years ago
- Real Time Microphone Voice Changer Python 3.6+ App. Works with On-Line Games and VideoConferences!☆276Updated 5 years ago