ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling
☆138Jun 11, 2026Updated 2 weeks ago
Alternatives and similar repositories for controlfoley
Users that are interested in controlfoley are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code for our paper StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset in IJCAI 2023.☆13Jul 17, 2024Updated last year
- ☆21Oct 4, 2025Updated 8 months ago
- Music Language Model Generation, Optimization, and Practice☆62Apr 20, 2026Updated 2 months ago
- Reddit Crawler API for collecting datasets from Reddit.☆11Dec 31, 2022Updated 3 years ago
- YOLOv8安全帽工作服检测☆12Oct 13, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆29Nov 18, 2025Updated 7 months ago
- ☆103Mar 13, 2026Updated 3 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆25Mar 8, 2026Updated 3 months ago
- Official Implementation of GLAP - General Language Audio Pretraining☆73May 14, 2026Updated last month
- python实现微博热点事件舆情分析(爬虫)☆12May 5, 2022Updated 4 years ago
- ☆30Jun 19, 2025Updated last year
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆49Oct 23, 2025Updated 8 months ago
- 快帆云机场官网地址☆10Nov 26, 2024Updated last year
- Official implementation of “JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery“☆36Aug 21, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICLR 2026] Data Pipeline, Models, and Benchmark for Omni-Captioner.☆138Apr 7, 2026Updated 2 months ago
- [ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆140Sep 2, 2025Updated 9 months ago
- [SIGGRAPH 2026 / TOG] Official code of the paper "UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Pr…☆234May 15, 2026Updated last month
- ☆41May 12, 2026Updated last month
- This repository is a paper summary of the latest progress in cooperative/collaborative/multi-agent perception datasets in autonomous dri…☆51Aug 15, 2025Updated 10 months ago
- 恋爱记事本,一款轻便记录情侣日常生活的小程序。☆19Dec 28, 2023Updated 2 years ago
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Feb 15, 2026Updated 4 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆55Jun 6, 2025Updated last year
- ☆71Dec 30, 2025Updated 6 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆109Jun 17, 2026Updated last week
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆17Apr 22, 2025Updated last year
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- 中国计算机设计大赛-人工智能挑战赛-国家二等奖☆19Oct 22, 2022Updated 3 years ago
- Details of the datasets for Few-shot class-incremental audio classification☆10Dec 6, 2023Updated 2 years ago
- ☆19Apr 2, 2025Updated last year
- Simple tetris game written in Kotlin☆29Jun 14, 2023Updated 3 years ago
- Template for creating audio encoders compatible with X-ARES☆19Feb 11, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- unofficial Split Mean Flow Implementation from bytedance☆70Aug 12, 2025Updated 10 months ago
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆21Jul 4, 2021Updated 4 years ago
- ☆23Nov 6, 2023Updated 2 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 7 months ago
- Author implementation of RiDDLE: Reversible and Diversified De-identification with Latent Encryptor (CVPR 2023)☆53Jul 3, 2025Updated 11 months ago
- ☆18Jun 21, 2021Updated 5 years ago
- 华中科技大学计算机网络实验1:Socket编程实现的简易HTTP服务器☆28Nov 2, 2019Updated 6 years ago