Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)
☆58Oct 25, 2019Updated 6 years ago
Alternatives and similar repositories for Vision-Infused-Audio-Inpainter-VIAI
Users that are interested in Vision-Infused-Audio-Inpainter-VIAI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆72Oct 20, 2020Updated 5 years ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- The thesis template for PhD of CUHK.☆21Jun 22, 2020Updated 5 years ago
- Code for ECCV 2020 paper "Open-Edit: Open-domain Image Manipulation with Open-Vocabulary Instructions"☆55Aug 27, 2021Updated 4 years ago
- ☆29May 4, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for NeurIPS 2019 paper "Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis"☆129Mar 10, 2020Updated 6 years ago
- Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441☆11Oct 25, 2022Updated 3 years ago
- Code related to my Bachelor's Thesis Project☆13Jun 17, 2016Updated 9 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆34Jul 29, 2019Updated 6 years ago
- This is the public repository for our accepted ICCV 2019 paper "Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild"☆70Nov 20, 2021Updated 4 years ago
- PyTorch implementation of "Quality Aware Network for Set to Set Recognition"☆11Jun 13, 2018Updated 7 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- Learning to Steer by Mimicking Features from Heterogeneous Auxiliary Networks (AAAI 2019, oral)☆54May 16, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆21Jul 20, 2022Updated 3 years ago
- ☆19Jul 12, 2020Updated 5 years ago
- ☆22Oct 12, 2023Updated 2 years ago
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)☆142Sep 19, 2021Updated 4 years ago
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆71Jul 8, 2021Updated 4 years ago
- ☆15Jun 15, 2022Updated 3 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Jul 21, 2021Updated 4 years ago
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆48Nov 4, 2020Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)☆814May 11, 2021Updated 4 years ago
- Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"☆11Jul 10, 2019Updated 6 years ago
- [ECCV2022] Mind the Gap in Distilling StyleGANs☆29May 7, 2023Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Code for Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images (CVPR 2020)☆495Aug 27, 2020Updated 5 years ago
- [MICCAI2020] Code for paper : Deep Semi-supervised Knowledge Distillation for Overlapping Cervical Cell Instance Segmentation☆54Nov 18, 2020Updated 5 years ago
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].☆20Sep 19, 2024Updated last year
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Co-Separating Sounds of Visual Objects (ICCV 2019)☆99Jul 25, 2023Updated 2 years ago
- Some Demo Code for the MPA Exercise.☆10Dec 4, 2017Updated 8 years ago
- Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).☆29Feb 15, 2022Updated 4 years ago
- ☆13Sep 17, 2021Updated 4 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆50Sep 24, 2019Updated 6 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆35Mar 18, 2023Updated 3 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated last year