dsforza96 / visual-micLinks
Passive Recovery of Sound from Video
☆47Updated 4 years ago
Alternatives and similar repositories for visual-mic
Users that are interested in visual-mic are comparing it to the libraries listed below
Sorting:
- DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.☆844Updated last year
- Tacotron2 Training Notebook for FakeYou.com☆164Updated 7 months ago
- Scripts to read out the Kinect v1 (xbox 360) scanner and combine multiple scans into meshes, pointclouds or voxels.☆24Updated 6 years ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆436Updated 4 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆170Updated 4 years ago
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆985Updated 7 months ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆837Updated last year
- Many seemingly static scenes contain subtle changes that are invisible to the naked human eye. However, it is possible to pull out these …☆12Updated 4 years ago
- Performant and accurate speech recognition built on Pytorch☆253Updated 3 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style tr…☆899Updated last year
- Mimic Recording Studio is a Docker-based application you can install to record voice samples, which can then be trained into a TTS voice …☆509Updated 2 years ago
- ⏩ Generating speech in a single forward pass without any attention!☆579Updated 10 months ago
- SpikeStream Visualisation Code as seen on https://spikestream.corticallabs.com and https://www.youtube.com/watch?v=9ksLuRoEq6A☆14Updated last year
- Audio super resolution using neural networks☆1,232Updated last year
- Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL☆175Updated 2 years ago
- Some quick BLOOM LLM examples☆257Updated 2 years ago
- This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech S…☆709Updated last year
- API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend☆335Updated 3 years ago
- Audio Denoising with Deep Network Priors☆162Updated 4 years ago
- 🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.☆1,149Updated last year
- Check out more of my projects: https://calebolson.com☆22Updated 2 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.☆403Updated 3 years ago
- ☆142Updated 4 years ago
- Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)☆189Updated last year
- This is a small POC running on an ESP32, exploiting CVE-2022-42722 to crash Linux devices over the air.☆80Updated 2 years ago
- Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.☆765Updated 9 months ago
- Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code☆437Updated last year
- Pytorch Implementation of wavegan model to generate audio☆164Updated 4 years ago
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆45Updated last year
- A GPT-J API to use with python3 to generate text, blogs, code, and more☆204Updated 2 years ago