Yusiissy/SonicVisionLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Yusiissy/SonicVisionLM)

Yusiissy / SonicVisionLM

☆75

Alternatives and similar repositories for SonicVisionLM

Users that are interested in SonicVisionLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ku-vai / TPoS
View on GitHub
This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)
☆25Dec 7, 2023Updated 2 years ago
ZYH-Lightyear / LVAS
View on GitHub
LVAS-Agent Code Base
☆21Apr 15, 2025Updated last year
zhuole1025 / SymMV
View on GitHub
[ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation
☆78Mar 29, 2024Updated 2 years ago
camenduru / AutoStudio-jupyter
View on GitHub
☆15Jun 25, 2024Updated 2 years ago
camenduru / Depth-Anything-jupyter
View on GitHub
☆11Feb 9, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
OpenNLPLab / MMVAE-AVS
View on GitHub
Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].
☆20Sep 19, 2024Updated last year
ghost-signal / myna
View on GitHub
Official repository of Myna: Masking-Based Contrastive Learning of Musical Representations
☆17Mar 31, 2025Updated last year
thuhcsi / SnakeGAN
View on GitHub
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Apr 25, 2023Updated 3 years ago
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
camenduru / CCSR-colab
View on GitHub
☆17Jan 10, 2024Updated 2 years ago
ariesssxu / vta-ldm
View on GitHub
☆61Jun 15, 2025Updated last year
SAGNIKMJR / ego-AV-spatial-correspondence
View on GitHub
[CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'
☆14Jun 16, 2024Updated 2 years ago
luosiallen / Diff-Foley
View on GitHub
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
☆206May 29, 2024Updated 2 years ago
Ego4DSounds / Ego4DSounds
View on GitHub
Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence
☆21Jun 14, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
AgentCooper2002 / EDMSound
View on GitHub
Codebase and project page for EDMSound
☆35Nov 20, 2023Updated 2 years ago
yzxing87 / Seeing-and-Hearing
View on GitHub
[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
☆155Jul 6, 2024Updated 2 years ago
camenduru / InstantID-jupyter
View on GitHub
☆20Jan 22, 2024Updated 2 years ago
camenduru / resemble-enhance-colab
View on GitHub
☆22Jan 15, 2024Updated 2 years ago
camenduru / Mix-of-Show-colab
View on GitHub
☆13Dec 18, 2023Updated 2 years ago
kaist-ami / SoundBrush
View on GitHub
☆14Dec 8, 2025Updated 7 months ago
SJTMusicTeam / MusicGeneration
View on GitHub
☆10May 15, 2021Updated 5 years ago
OpenNLPLab / TAVGBench
View on GitHub
Demo page of TAVGBench: Benchmarking Text to Audible-Video Generation
☆15Apr 7, 2025Updated last year
camenduru / FreeInit-colab
View on GitHub
☆25Dec 21, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
v-iashin / Synchformer
View on GitHub
Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)
☆130Sep 15, 2025Updated 10 months ago
camenduru / audioldm-colab
View on GitHub
AudioLDM text to audio colab
☆18Nov 6, 2023Updated 2 years ago
vmeylan / youtube_video_and_transcript_downloader
View on GitHub
Fetches transcripts from YouTube videos, including private ones with granted access, and optionally downloads the videos. Does not suppor…
☆18Apr 17, 2024Updated 2 years ago
thuhcsi / DiffVar
View on GitHub
☆30Aug 12, 2023Updated 2 years ago
spkgyk / TDFNet
View on GitHub
Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023
☆14Mar 17, 2024Updated 2 years ago
yukara-ikemiya / friendly-stable-audio-tools
View on GitHub
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…
☆218Jul 25, 2024Updated 2 years ago
kuai-lab / soundini-official
View on GitHub
We are committing code.
☆44May 18, 2023Updated 3 years ago
guyyariv / TempoTokens
View on GitHub
[AAAI 2024] The official PyTorch implementation of "Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation"
☆131May 18, 2026Updated 2 months ago
lzhangbj / ASVA
View on GitHub
[ECCV 2024 Oral] Audio-Synchronized Visual Animation
☆60Mar 15, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BriansIDP / AudioVisualLLM
View on GitHub
☆19May 19, 2024Updated 2 years ago
UnrealXinda / CPP-Fluid-Particles
View on GitHub
A wrapper project to interface CUDA implementation of SPH liquid simulation with UE4
☆12Jun 29, 2020Updated 6 years ago
camenduru / FluxMusic-jupyter
View on GitHub
☆18Sep 4, 2024Updated last year
OpenGVLab / LORIS
View on GitHub
[ICML2023] Long-Term Rhythmic Video Soundtracker
☆63Jul 28, 2025Updated last year
HUIZ-A / SVA
View on GitHub
☆20Apr 26, 2024Updated 2 years ago
open-mmlab / FoleyCrafter
View on GitHub
[IJCV 2026] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给你的无声视频添加生动而且同步的音效 😝
☆659Jun 15, 2026Updated last month
artaction / OP-Z_Controls_Ableton
View on GitHub
This contains an Ableton session template and instrument racks that allow you to plug and play 8 instrument tracks and effects in Ableton…
☆19May 14, 2019Updated 7 years ago