snap-research / AVLink
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
☆14Updated 4 months ago
Alternatives and similar repositories for AVLink
Users that are interested in AVLink are comparing it to the libraries listed below
Sorting:
- ☆31Updated last month
- ☆16Updated 2 years ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆23Updated last year
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆37Updated last year
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Gener…☆16Updated 10 months ago
- Pytorch implementation of Temporally Consistent Face Reenactment with 3D Geometric Guidance☆13Updated last week
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 3 months ago
- ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer☆33Updated 4 months ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆39Updated last year
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆23Updated 2 months ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆33Updated 2 months ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆40Updated last year
- ☆11Updated 9 months ago
- AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis☆10Updated 7 months ago
- ☆22Updated 6 months ago
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 10 months ago
- Code for Novel View Acoustic Synthesis paper☆47Updated last year
- DDS: Delta Denoising Score PyTorch implementation