Official implementation of "Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound". IEEE TASLP 2025.
☆18Feb 27, 2026Updated 3 months ago
Alternatives and similar repositories for video-foley
Users that are interested in video-foley are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos☆26Oct 1, 2024Updated last year
- Music production for silent film clips.☆32Apr 30, 2025Updated last year
- Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".☆93Dec 8, 2023Updated 2 years ago
- Stable-V2A: Synthesis of Synchronized Sound Effect with Temporal and Semantic Controls☆18May 27, 2025Updated last year
- Our team is employed by a pet shop to develop a web-based grooming appointment system where customers can make appointment with the pet s…☆13Oct 13, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"☆27Sep 9, 2025Updated 9 months ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆44Jun 13, 2024Updated 2 years ago
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)☆34Feb 11, 2026Updated 4 months ago
- ☆44Jan 13, 2025Updated last year
- ☆10Jan 4, 2022Updated 4 years ago
- ☆17Jun 10, 2025Updated last year
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆32Mar 8, 2024Updated 2 years ago
- [arXiv 2025] ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models☆38Aug 26, 2025Updated 9 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆49Sep 11, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Floorplan Recognition especially for complicated drawings☆16Apr 3, 2024Updated 2 years ago
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆65Jul 2, 2025Updated 11 months ago
- Customizable native Vue3 drag-n-drop☆12Mar 9, 2024Updated 2 years ago
- Implementation of "Factorized Diffusion Autoencoder for Unsupervised Disentangled Representation Learning" (AAAI 2024)☆17Dec 18, 2023Updated 2 years ago
- Tensorflow implementation of Shearlab, including a python wrapper of the Julia Shearlab APi☆12Apr 22, 2021Updated 5 years ago
- Baseline to denoise + learn descriptors in N-HPatches☆17Mar 14, 2019Updated 7 years ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- Real time chat with socket.io and Node.js☆12Oct 25, 2017Updated 8 years ago
- Audio-Visual Room Impulse Response Estimation☆24Jul 22, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆37Jan 6, 2026Updated 5 months ago
- Reduce requests to backend services by batching calls and caching records.☆12Mar 8, 2023Updated 3 years ago
- This repository provides an easy way to train your models on the datasets of DCASE task 1.☆20May 28, 2025Updated last year
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆103Jun 12, 2025Updated last year
- to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550☆14Nov 15, 2024Updated last year
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆44Dec 13, 2024Updated last year
- ☆17Jan 10, 2024Updated 2 years ago
- Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge☆16Feb 10, 2023Updated 3 years ago
- [BMVC 2022] Information Theoretic Representation Distillation☆19Oct 6, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation☆125Jan 18, 2023Updated 3 years ago
- Submission for task 2 "First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring" of the DCASE challenge 2023 (h…☆18May 22, 2023Updated 3 years ago
- Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence☆21Jun 14, 2024Updated 2 years ago
- Infinite Scroll Timeline for Vue 3☆13May 23, 2025Updated last year
- [AAAI 2024] Understanding the Role of the Projector in Knowledge Distillation☆20Feb 13, 2024Updated 2 years ago
- A collection of open source libraries and documentation for building robot platforms on Formant APIs 🤖☆20Updated this week
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated last year