kszpxxzmc / ViSAudioLinks
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
☆109Updated last month
Alternatives and similar repositories for ViSAudio
Users that are interested in ViSAudio are comparing it to the libraries listed below
Sorting:
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆64Updated last week
- ☆29Updated 10 months ago
- Animate Any Character in Any World☆86Updated last week
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Updated last year
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆295Updated 4 months ago
- ☆227Updated 6 months ago
- An official implementation of SwapAnyone.☆73Updated 10 months ago
- A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using …☆278Updated last month
- Scaling Zero-Shot Reference-to-Video Generation☆61Updated last month
- Music production for silent film clips.☆31Updated 8 months ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆39Updated last month
- ☆46Updated 2 months ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆222Updated 5 months ago
- Krea Realtime 14B. An open-source realtime AI video model.☆459Updated 2 months ago
- MTVCraft: An Open Veo3-style Audio-Video Generation Demo☆96Updated 3 months ago
- Official implementation of Progressive Detail Injection for Training-Free Semantic Binding in Text-to-Image Generation☆31Updated 5 months ago
- Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing☆137Updated this week
- A Unified Visual Generator with Interleaved OmniModal Context☆163Updated 2 weeks ago
- ☆77Updated 8 months ago
- ☆33Updated 2 months ago
- ☆171Updated 2 months ago
- https://little-misfit.github.io/GRAG-Image-Editing/☆116Updated last month
- DreamStyle: A Unified Framework for Video Stylization☆89Updated 2 weeks ago
- ☆83Updated this week
- 👋 Dataset and Benchmark code for EgoEdit☆104Updated last month
- VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning☆60Updated 2 months ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆210Updated 2 months ago
- We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while a…☆417Updated last week
- Lynx: Towards High-Fidelity Personalized Video Generation☆305Updated 3 months ago
- Make self forcing endless. Add cache purging. Add prompt controllability.☆68Updated 4 months ago