ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling
☆129Apr 22, 2026Updated last month
Alternatives and similar repositories for controlfoley
Users that are interested in controlfoley are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code for our paper StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset in IJCAI 2023.☆13Jul 17, 2024Updated last year
- ☆21Oct 4, 2025Updated 8 months ago
- Music Language Model Generation, Optimization, and Practice☆59Apr 20, 2026Updated last month
- Reddit Crawler API for collecting datasets from Reddit.☆11Dec 31, 2022Updated 3 years ago
- YOLOv8安全帽工作服检测☆12Oct 13, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆29Nov 18, 2025Updated 6 months ago
- ☆101Mar 13, 2026Updated 2 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆25Mar 8, 2026Updated 3 months ago
- Official Implementation of GLAP - General Language Audio Pretraining☆73May 14, 2026Updated 3 weeks ago
- python实现微博热点事件舆情分析(爬虫)☆12May 5, 2022Updated 4 years ago
- ☆30Jun 19, 2025Updated 11 months ago
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆49Oct 23, 2025Updated 7 months ago
- 快帆云机场官网地址☆10Nov 26, 2024Updated last year
- Official implementation of “JOTR: 3D Joint Contrastive Learning with Transformers for Occluded Human Mesh Recovery“☆36Aug 21, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICLR 2026] Data Pipeline, Models, and Benchmark for Omni-Captioner.☆134Apr 7, 2026Updated 2 months ago
- ☆51Updated this week
- [ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆140Sep 2, 2025Updated 9 months ago
- [SIGGRAPH 2026 / TOG] Official code of the paper "UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Pr…☆225May 15, 2026Updated 3 weeks ago
- ☆41May 12, 2026Updated 3 weeks ago
- This repository is a paper summary of the latest progress in cooperative/collaborative/multi-agent perception datasets in autonomous dri…☆50Aug 15, 2025Updated 9 months ago
- 恋爱记事本,一款轻便记录情侣日常生活的小程序。☆19Dec 28, 2023Updated 2 years ago
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Feb 15, 2026Updated 3 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆55Jun 6, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆70Dec 30, 2025Updated 5 months ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆17Apr 22, 2025Updated last year
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- 中国计算机设计大赛-人工智能挑战赛-国家二等奖☆19Oct 22, 2022Updated 3 years ago
- Details of the datasets for Few-shot class-incremental audio classification☆10Dec 6, 2023Updated 2 years ago
- ☆18Apr 2, 2025Updated last year
- Simple tetris game written in Kotlin☆27Jun 14, 2023Updated 2 years ago
- Template for creating audio encoders compatible with X-ARES☆19Feb 11, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- unofficial Split Mean Flow Implementation from bytedance☆70Aug 12, 2025Updated 9 months ago
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆21Jul 4, 2021Updated 4 years ago
- ☆22Nov 6, 2023Updated 2 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 6 months ago
- Author implementation of RiDDLE: Reversible and Diversified De-identification with Latent Encryptor (CVPR 2023)☆52Jul 3, 2025Updated 11 months ago
- ☆17Jun 21, 2021Updated 4 years ago
- 华中科技大学计算机网络实验1:Socket编程实现的简易HTTP服务器☆27Nov 2, 2019Updated 6 years ago