Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)
☆58Oct 25, 2019Updated 6 years ago
Alternatives and similar repositories for Vision-Infused-Audio-Inpainter-VIAI
Users that are interested in Vision-Infused-Audio-Inpainter-VIAI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆72Oct 20, 2020Updated 5 years ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- The thesis template for PhD of CUHK.☆21Jun 22, 2020Updated 5 years ago
- Code for ECCV 2020 paper "Open-Edit: Open-domain Image Manipulation with Open-Vocabulary Instructions"☆55Aug 27, 2021Updated 4 years ago
- ☆29May 4, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for NeurIPS 2019 paper "Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis"☆129Mar 10, 2020Updated 6 years ago
- Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441☆12Oct 25, 2022Updated 3 years ago
- Code related to my Bachelor's Thesis Project☆13Jun 17, 2016Updated 10 years ago
- Utils for computer vision research.☆72Oct 12, 2018Updated 7 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆34Jul 29, 2019Updated 6 years ago
- This is the public repository for our accepted ICCV 2019 paper "Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild"☆69Nov 20, 2021Updated 4 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated 2 years ago
- Learning to Steer by Mimicking Features from Heterogeneous Auxiliary Networks (AAAI 2019, oral)☆54May 16, 2019Updated 7 years ago
- ☆19Jul 12, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆22Jul 20, 2022Updated 3 years ago
- Can Neural Networks reconstruct missing audio data? What about GANs?☆18Nov 6, 2019Updated 6 years ago
- Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)☆142Sep 19, 2021Updated 4 years ago
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆72Jul 8, 2021Updated 4 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆33May 15, 2023Updated 3 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆30May 22, 2022Updated 4 years ago
- A context encoder for audio inpainting☆26Mar 24, 2023Updated 3 years ago
- ☆15Jun 15, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆47Nov 4, 2020Updated 5 years ago
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)☆814May 11, 2021Updated 5 years ago
- A PyTorch implementation of the Modified Discrete Cosine Transform (MDCT) and its inverse for audio processing.☆33Dec 17, 2024Updated last year
- Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"☆11Jul 10, 2019Updated 6 years ago
- This repository contains the audio samples and the source code that accompany the paper: "MixCycle: Unsupervised Speech Separation via Cy…☆24Jan 10, 2023Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 4 years ago
- Code for Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images (CVPR 2020)☆494Aug 27, 2020Updated 5 years ago
- Pytorch code for Towards Backward-Compatible Representation Learning [CVPR 2020 Oral]☆55Jul 12, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year
- Co-Separating Sounds of Visual Objects (ICCV 2019)☆98Jul 25, 2023Updated 2 years ago
- Some Demo Code for the MPA Exercise.☆10Dec 4, 2017Updated 8 years ago
- Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).☆29Feb 15, 2022Updated 4 years ago
- Code for reproducing experiments in "Exploiting GAN Internal Capacity for High-Quality Reconstruction of Natural Images"☆16Nov 14, 2019Updated 6 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆50Sep 24, 2019Updated 6 years ago