Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)
☆58Oct 25, 2019Updated 6 years ago
Alternatives and similar repositories for Vision-Infused-Audio-Inpainter-VIAI
Users that are interested in Vision-Infused-Audio-Inpainter-VIAI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆72Oct 20, 2020Updated 5 years ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- The thesis template for PhD of CUHK.☆21Jun 22, 2020Updated 5 years ago
- Code for ECCV 2020 paper "Open-Edit: Open-domain Image Manipulation with Open-Vocabulary Instructions"☆55Aug 27, 2021Updated 4 years ago
- ☆29May 4, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for NeurIPS 2019 paper "Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis"☆129Mar 10, 2020Updated 6 years ago
- Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441☆12Oct 25, 2022Updated 3 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- Utils for computer vision research.☆72Oct 12, 2018Updated 7 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆34Jul 29, 2019Updated 6 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated 2 years ago
- Learning to Steer by Mimicking Features from Heterogeneous Auxiliary Networks (AAAI 2019, oral)☆54May 16, 2019Updated 7 years ago
- ☆19Jul 12, 2020Updated 5 years ago
- Code for Switchable Whitening (ICCV2019)☆137Dec 18, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)☆142Sep 19, 2021Updated 4 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆72Jul 8, 2021Updated 4 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆33May 15, 2023Updated 3 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆30May 22, 2022Updated 4 years ago
- A context encoder for audio inpainting☆26Mar 24, 2023Updated 3 years ago
- ☆15Jun 15, 2022Updated 3 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆47Nov 4, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)☆814May 11, 2021Updated 5 years ago
- Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"☆11Jul 10, 2019Updated 6 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Code for Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images (CVPR 2020)☆493Aug 27, 2020Updated 5 years ago
- Pytorch code for Towards Backward-Compatible Representation Learning [CVPR 2020 Oral]☆55Jul 12, 2021Updated 4 years ago
- [MICCAI2020] Code for paper : Deep Semi-supervised Knowledge Distillation for Overlapping Cervical Cell Instance Segmentation☆54Nov 18, 2020Updated 5 years ago
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Co-Separating Sounds of Visual Objects (ICCV 2019)☆98Jul 25, 2023Updated 2 years ago
- Some Demo Code for the MPA Exercise.☆10Dec 4, 2017Updated 8 years ago
- Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).☆29Feb 15, 2022Updated 4 years ago
- ☆13Sep 17, 2021Updated 4 years ago
- Code for reproducing experiments in "Exploiting GAN Internal Capacity for High-Quality Reconstruction of Natural Images"☆16Nov 14, 2019Updated 6 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆35Mar 18, 2023Updated 3 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated 2 years ago