jaeyeonkim99 / visageLinks
Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)
β31Updated last week
Alternatives and similar repositories for visage
Users that are interested in visage are comparing it to the libraries listed below
Sorting:
- This package aims at simplifying the download of the AudioCaps dataset.β36Updated last year
- π¦ Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)β59Updated 7 months ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"