Hangz-nju-cuhk/Vision-Infused-Audio-Inpainter-VIAI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Hangz-nju-cuhk/Vision-Infused-Audio-Inpainter-VIAI)

Hangz-nju-cuhk / Vision-Infused-Audio-Inpainter-VIAI

Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)

☆58

Alternatives and similar repositories for Vision-Infused-Audio-Inpainter-VIAI

Users that are interested in Vision-Infused-Audio-Inpainter-VIAI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SheldonTsui / SepStereo_ECCV2020
View on GitHub
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
☆72Oct 20, 2020Updated 5 years ago
xh-liu / Open-Edit
View on GitHub
Code for ECCV 2020 paper "Open-Edit: Open-domain Image Manipulation with Open-Vocabulary Instructions"
☆55Aug 27, 2021Updated 4 years ago
nperraud / gan_audio_inpainting
View on GitHub
☆29May 4, 2020Updated 6 years ago
xh-liu / CC-FPSE
View on GitHub
Code for NeurIPS 2019 paper "Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis"
☆129Mar 10, 2020Updated 6 years ago
sanuj / nuclei-net
View on GitHub
Code related to my Bachelor's Thesis Project
☆13Jun 17, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hellock / cvbase
View on GitHub
Utils for computer vision research.
☆72Oct 12, 2018Updated 7 years ago
xh-liu / CM-Erase-REG
View on GitHub
Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"
☆34Jul 29, 2019Updated 7 years ago
penincillin / DCT_ICCV-2019
View on GitHub
This is the public repository for our accepted ICCV 2019 paper "Delving Deep Into Hybrid Annotations for 3D Human Recovery in the Wild"
☆69Nov 20, 2021Updated 4 years ago
yxgeee / PyTorch-QAN
View on GitHub
PyTorch implementation of "Quality Aware Network for Set to Set Recognition"
☆11Jun 13, 2018Updated 8 years ago
JorisCos / VCTK-2Mix
View on GitHub
☆19Jul 12, 2020Updated 6 years ago
XingangPan / Switchable-Whitening
View on GitHub
Code for Switchable Whitening (ICCV2019)
☆137Dec 18, 2019Updated 6 years ago
RanaCM / DSU-AVO
View on GitHub
Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023
☆12May 13, 2024Updated 2 years ago
iamhectorotero / generative-audio-inpainting
View on GitHub
Can Neural Networks reconstruct missing audio data? What about GANs?
☆18Nov 6, 2019Updated 6 years ago
TencentARC / SFDA
View on GitHub
☆21Jul 20, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhmiao / OpenCompoundDomainAdaptation-OCDA
View on GitHub
Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)
☆142Sep 19, 2021Updated 4 years ago
SheldonTsui / PseudoBinaural_CVPR2021
View on GitHub
Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)
☆72Jul 8, 2021Updated 5 years ago
TencentARC / FLM
View on GitHub
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
☆31May 15, 2023Updated 3 years ago
showlab / AVA-AVD
View on GitHub
☆22Nov 24, 2022Updated 3 years ago
andimarafioti / audioContextEncoder
View on GitHub
A context encoder for audio inpainting
☆26Mar 24, 2023Updated 3 years ago
TencentARC / DTN
View on GitHub
Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.
☆29May 22, 2022Updated 4 years ago
Hangz-nju-cuhk / Talking-Face-Generation-DAVS
View on GitHub
Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)
☆813May 11, 2021Updated 5 years ago
Andong-Li-speech / RTNet
View on GitHub
implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain
☆47Nov 4, 2020Updated 5 years ago
xcmyz / ConvTasNet4BasisMelGAN
View on GitHub
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆21Jul 21, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
fmcarlucci / ADAGE
View on GitHub
Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"
☆11Jul 10, 2019Updated 7 years ago
xuguodong03 / StyleKD
View on GitHub
[ECCV2022] Mind the Gap in Distilling StyleGANs
☆29May 7, 2023Updated 3 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
ertug / MixCycle
View on GitHub
Source code and audio samples for the paper "MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation Invariant Training"
☆25Jun 21, 2026Updated last month
Hangz-nju-cuhk / Rotate-and-Render
View on GitHub
Code for Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images (CVPR 2020)
☆494Aug 27, 2020Updated 5 years ago
Amandaynzhou / MMT-PSM
View on GitHub
[MICCAI2020] Code for paper : Deep Semi-supervised Knowledge Distillation for Overlapping Cervical Cell Instance Segmentation
☆54Nov 18, 2020Updated 5 years ago
YantaoShen / openBCT
View on GitHub
Pytorch code for Towards Backward-Compatible Representation Learning [CVPR 2020 Oral]
☆55Jul 12, 2021Updated 5 years ago
yoyolicoris / kazane
View on GitHub
Simple sinc interpolation in PyTorch.
☆15Jul 8, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rhgao / co-separation
View on GitHub
Co-Separating Sounds of Visual Objects (ICCV 2019)
☆98Jul 25, 2023Updated 3 years ago
eyaler / clip_biggan
View on GitHub
☆13Sep 17, 2021Updated 4 years ago
Overcautious / ADENet
View on GitHub
Accepted by TMM 2022
☆19Aug 18, 2022Updated 3 years ago
OpenNLPLab / MMVAE-AVS
View on GitHub
Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].
☆20Sep 19, 2024Updated last year
rhgao / Deep-MIML-Network
View on GitHub
Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)
☆50Sep 24, 2019Updated 6 years ago
alvinliu0 / Visual-Sound-Localization-in-the-Wild
View on GitHub
Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).
☆29Feb 15, 2022Updated 4 years ago
JuanFMontesinos / VoViT
View on GitHub
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
☆35Mar 18, 2023Updated 3 years ago