Tencent-Hunyuan / HunyuanVideo-FoleyLinks
HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.
☆1,227Updated last month
Alternatives and similar repositories for HunyuanVideo-Foley
Users that are interested in HunyuanVideo-Foley are comparing it to the libraries listed below
Sorting:
- This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation''☆2,333Updated 2 weeks ago
 - ☆307Updated 3 months ago
 - 🔥 An agile development and testing platform designed to empower small and medium-sized enterprises to build their own R&D systems, strea…☆2,037Updated this week
 - 53AI Hub is an open-source AI portal, which enables you to quickly build a operational-level AI portal to launch and operate AI agents, p…☆3,209Updated last week
 - 全语言制品仓库,涵盖npm、Maven、PyPi、Docker、Gradle、SBT、Cocoapods、Swift、RPM、Debian、PHP、Go、Pub、Ivy、NuGet、Conda、Cargo、Conan、Yarn、GitLFS、Helm、OHPM等主流工具,涵…☆2,528Updated 2 weeks ago
 - [NeurIPS 2025] Native-resolution diffusion Transformer☆289Updated 2 weeks ago
 - Moxin is a family of fully open-source and reproducible LLMs☆613Updated 4 months ago
 - A fast gigapixel processing system☆2,007Updated 10 months ago
 - fount (aka 豊人(ほうと)/纺铽(fǎng tè | ㄈㄤˇ ㄊㄜˋ)) is an extensible framework for building and hosting AI character interactions. Built with pure …☆846Updated last week
 - 🔥 A unified system resource management platform designed for administrators, serving as the foundational module for the Angus applicatio…☆537Updated last week
 - AIDoctor training medical GPT model with ChatGPT training pipeline, implemantation of Pretraining, Supervised Finetuning, RLHF(Reward Mod…☆273Updated 7 months ago
 - [NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent☆678Updated last week
 - Repository of AudioGenie☆226Updated last week
 - SCoralDet and SCoralDet Dataset☆127Updated last month
 - TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based …☆803Updated last week
 - PixelHacker: Image Inpainting with Structural and Semantic Consistency☆451Updated 5 months ago
 - ☆929Updated 2 months ago
 - MGM-Omni: Scaling Omni LLMs to Personalized Long-Horizon Speech☆251Updated 3 weeks ago
 - csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and App…☆1,083Updated this week
 - 🧩 IMAGHarmony 🧩: Controllable image editing with consistent object quantity and layout. A structure-aware framework that ensures high f…☆223Updated 2 weeks ago
 - Nexent is a zero-code platform for auto-generating agents — no orchestration, no complex drag-and-drop required. Nexent also offers power…☆3,678Updated this week
 - ☆354Updated this week
 - MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement☆430Updated this week
 - A high-performance IM server.☆3,658Updated this week
 - Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆153Updated 2 months ago
 - 🔥 JMock is a high-performance data generation and simulation component library implemented in Java.☆234Updated last week
 - UI-Venus is a native UI agent designed to perform precise GUI element grounding and effective navigation using only screenshots as input.☆494Updated 2 months ago
 - Let's use AI to Earn!☆6,346Updated last week
 - 🔥 OpenAPIDesigner is an open-source OpenAPI specification design tool that allows developers to design, write, and validate OpenAPI spec…☆479Updated last week
 - 生产级iOS网络通信、架构实战 基于 CocoaAsyncSocket 打造的高性能底层通信框架,日均处理万级别消息,真实服务于企业客户!来源于多年IM开发经验总结,经过生产环境验证(已脱敏),完整呈现从单TCP架构到企业级多路复用架构的演进之路。☆748Updated 2 months ago