☆48Jul 10, 2024Updated last year
Alternatives and similar repositories for video-physics-sound-diffusion
Users that are interested in video-physics-sound-diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)☆33Feb 11, 2026Updated last month
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆93Dec 8, 2023Updated 2 years ago
- Solos: A Dataset for Audio-Visual Music Analysis☆24Feb 17, 2023Updated 3 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation☆124Jan 18, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MeshRIR: Dataset of room impulse responses on meshed grid points☆43Mar 13, 2026Updated 2 weeks ago
- ☆33Apr 10, 2023Updated 2 years ago
- ☆15Sep 24, 2022Updated 3 years ago
- ☆13Jul 14, 2024Updated last year
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- Geometric-Wave Acoustic dataset☆64Aug 14, 2022Updated 3 years ago
- This repository holds datasets of polyphonic drum patterns used in the creation of Electronic Dance Music.☆16Dec 19, 2016Updated 9 years ago
- A large-scale real-world audio-visual dataset for research on 3D scene understanding and echolocation.☆19Oct 21, 2025Updated 5 months ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆42Dec 23, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Sep 1, 2023Updated 2 years ago
- PyQt application for creating mass interaction models and compiling them into Faust dsp code, or ~gen objects (for Max/MSP)☆31Jan 26, 2021Updated 5 years ago
- Source Separation on Musical Instrument Sounds☆38Jan 4, 2022Updated 4 years ago
- ☆19Jan 30, 2023Updated 3 years ago
- PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "☆40Dec 15, 2020Updated 5 years ago
- [AAAI 2024] V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models☆28Dec 14, 2023Updated 2 years ago
- Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models☆201May 29, 2024Updated last year
- ☆16Sep 7, 2022Updated 3 years ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆46Sep 6, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆14Dec 21, 2024Updated last year
- ☆40Jul 19, 2022Updated 3 years ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- A Translation Task using TurboTransformers☆10Dec 17, 2020Updated 5 years ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 7 months ago
- Spatial Audio Generation☆117Mar 24, 2023Updated 3 years ago
- Another rubberband-wasm story but with ready-to-use AudioWorklet and WebWorker☆17Nov 14, 2022Updated 3 years ago
- [NeurIPS 2024] Code, Dataset, Samples for the VATT paper “ Tell What You Hear From What You See - Video to Audio Generation Through Text”☆36Jul 24, 2025Updated 8 months ago
- ☆18Jul 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".☆288Mar 20, 2024Updated 2 years ago
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆18Oct 11, 2024Updated last year
- [CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation☆452Jun 5, 2024Updated last year
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).☆19Apr 1, 2021Updated 4 years ago
- 为visinger SVS系统写的展示系统~本质仍然是个音乐播放器☆11Apr 18, 2023Updated 2 years ago
- Language-based navigation project☆22Feb 9, 2024Updated 2 years ago
- [NeurIPS'24 splotlight] Official Repo for AcoustiX used in Acoustic volume rendering for neural impulse response fields.☆37Dec 15, 2025Updated 3 months ago