Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)
☆58Oct 25, 2019Updated 6 years ago
Alternatives and similar repositories for Vision-Infused-Audio-Inpainter-VIAI
Users that are interested in Vision-Infused-Audio-Inpainter-VIAI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆72Oct 20, 2020Updated 5 years ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- Code for ECCV 2020 paper "Open-Edit: Open-domain Image Manipulation with Open-Vocabulary Instructions"☆55Aug 27, 2021Updated 4 years ago
- ☆29May 4, 2020Updated 5 years ago
- Code for NeurIPS 2019 paper "Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis"☆129Mar 10, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441☆11Oct 25, 2022Updated 3 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- Utils for computer vision research.☆72Oct 12, 2018Updated 7 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆34Jul 29, 2019Updated 6 years ago
- This is the public repository for our accepted ICCV 2019 paper "Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild"☆69Nov 20, 2021Updated 4 years ago
- PyTorch implementation of "Quality Aware Network for Set to Set Recognition"☆11Jun 13, 2018Updated 7 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- Learning to Steer by Mimicking Features from Heterogeneous Auxiliary Networks (AAAI 2019, oral)☆54May 16, 2019Updated 6 years ago
- ☆19Jul 12, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Jul 20, 2022Updated 3 years ago
- Code for Switchable Whitening (ICCV2019)☆137Dec 18, 2019Updated 6 years ago
- Can Neural Networks reconstruct missing audio data? What about GANs?☆18Nov 6, 2019Updated 6 years ago
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)☆142Sep 19, 2021Updated 4 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆71Jul 8, 2021Updated 4 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆29May 22, 2022Updated 3 years ago
- ☆15Jun 15, 2022Updated 3 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆47Nov 4, 2020Updated 5 years ago
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)☆813May 11, 2021Updated 4 years ago
- ☆22Oct 12, 2023Updated 2 years ago
- A PyTorch implementation of the Modified Discrete Cosine Transform (MDCT) and its inverse for audio processing.☆32Dec 17, 2024Updated last year
- Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"☆11Jul 10, 2019Updated 6 years ago
- [ECCV2022] Mind the Gap in Distilling StyleGANs☆29May 7, 2023Updated 2 years ago
- This repository contains the audio samples and the source code that accompany the paper: "MixCycle: Unsupervised Speech Separation via Cy…☆24Jan 10, 2023Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- [MICCAI2020] Code for paper : Deep Semi-supervised Knowledge Distillation for Overlapping Cervical Cell Instance Segmentation☆54Nov 18, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- Co-Separating Sounds of Visual Objects (ICCV 2019)☆99Jul 25, 2023Updated 2 years ago
- Some Demo Code for the MPA Exercise.☆10Dec 4, 2017Updated 8 years ago
- Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).☆29Feb 15, 2022Updated 4 years ago
- ☆13Sep 17, 2021Updated 4 years ago
- Code for reproducing experiments in "Exploiting GAN Internal Capacity for High-Quality Reconstruction of Natural Images"☆16Nov 14, 2019Updated 6 years ago