Official implementation of "Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound". IEEE TASLP 2025.
☆17Feb 27, 2026Updated last month
Alternatives and similar repositories for video-foley
Users that are interested in video-foley are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆25Oct 1, 2024Updated last year
- Music production for silent film clips.☆32Apr 30, 2025Updated 10 months ago
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆93Dec 8, 2023Updated 2 years ago
- Stable-V2A: Synthesis of Synchronized Sound Effect with Temporal and Semantic Controls☆18May 27, 2025Updated 10 months ago
- Our team is employed by a pet shop to develop a web-based grooming appointment system where customers can make appointment with the pet s…☆13Oct 13, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"☆27Sep 9, 2025Updated 6 months ago
- ☆43Jan 13, 2025Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)☆33Feb 11, 2026Updated last month
- ☆11Jan 4, 2022Updated 4 years ago
- ☆16Jun 10, 2025Updated 9 months ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆32Mar 8, 2024Updated 2 years ago
- [arXiv 2025] ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models☆36Aug 26, 2025Updated 7 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆48Sep 11, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Floorplan Recognition especially for complicated drawings☆15Apr 3, 2024Updated last year
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆61Jul 2, 2025Updated 8 months ago
- Customizable native Vue3 drag-n-drop☆12Mar 9, 2024Updated 2 years ago
- Implementation of "Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning" (AAAI 2024)☆17Dec 18, 2023Updated 2 years ago
- Tensorflow implementation of Shearlab, including a python wrapper of the Julia Shearlab APi☆12Apr 22, 2021Updated 4 years ago
- Baseline to denoise + learn descriptors in N-HPatches☆17Mar 14, 2019Updated 7 years ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- Real time chat with socket.io and Node.js☆12Oct 25, 2017Updated 8 years ago
- ☆36Jan 6, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Audio-Visual Room Impulse Response Estimation☆24Jul 22, 2024Updated last year
- Reduce requests to backend services by batching calls and caching records.☆12Mar 8, 2023Updated 3 years ago
- This repository provides an easy way to train your models on the datasets of DCASE task 1.☆20May 28, 2025Updated 10 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆96Jun 12, 2025Updated 9 months ago
- to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550☆14Nov 15, 2024Updated last year
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆44Dec 13, 2024Updated last year
- ☆17Jan 10, 2024Updated 2 years ago
- Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge☆16Feb 10, 2023Updated 3 years ago
- [BMVC 2022] Information Theoretic Representation Distillation☆19Oct 6, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation☆124Jan 18, 2023Updated 3 years ago
- Submission for task 2 "First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring" of the DCASE challenge 2023 (h…☆18May 22, 2023Updated 2 years ago
- Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence☆20Jun 14, 2024Updated last year
- Infinite Scroll Timeline for Vue 3☆13May 23, 2025Updated 10 months ago
- abc2midi is a program that converts an abc music notation file to a MIDI file.☆45Jun 1, 2016Updated 9 years ago
- [AAAI 2024] Understanding the Role of the Projector in Knowledge Distillation☆20Feb 13, 2024Updated 2 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 11 months ago