Omni2Sound — Your Multimodal Audio Generation Codebase (CVPR 2026 Highlight)
☆110Apr 25, 2026Updated last week
Alternatives and similar repositories for Omni2Sound
Users that are interested in Omni2Sound are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆81Apr 23, 2026Updated 2 weeks ago
- ☆89Oct 6, 2023Updated 2 years ago
- a systematic benchmark for best AI marketing related tools☆102Apr 4, 2026Updated last month
- Curated list of awesome AI tools for developers, content creators and office workers. Free, practical and easy to use.☆345Apr 18, 2026Updated 2 weeks ago
- 高性能数字人桌面应用框架,开箱即用,集成了AI对话与动态壁纸,即使在较低性能的设备上也能流畅运行数字人☆181Dec 22, 2025Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Making ANY Software Skill-Native -- Auto-generate production-ready AI Agent Skills for Claude Code, OpenClaw, Codex, and more.☆386Apr 6, 2026Updated last month
- FileGram: Grounding Agent Personalization in File-System Behavioral Traces☆64Apr 12, 2026Updated 3 weeks ago
- revolutionary enterprise-grade proactive AI platform for business operations☆81Apr 17, 2026Updated 2 weeks ago
- Give your AI Agent a cloud-native life. Deploy once, converse everywhere.☆272Feb 5, 2026Updated 3 months ago
- 🤖 基于深度学习的AI量化投资系统 | Vision-Based Quantitative Trading System with Deep Learning☆130Feb 27, 2026Updated 2 months ago
- DMPO: Diffusion Model Policy Optimization☆60Feb 1, 2026Updated 3 months ago
- Skills Manager 是一个基于 Tauri 构建的Windows轻量托盘应用,用于集中管理本地 AI agent skills。☆48Apr 29, 2026Updated last week
- 输入一本中文小说,提取结构化世界数据,然后以玩家身份进入同一世界。你的每个选择都会改写剧情走向。☆179Mar 23, 2026Updated last month
- Claude Code skill for improving website AEO (AI Engine Optimization) and GEO (Generative Engine Optimization) scores — 16 foundational ch…☆930Apr 24, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 集成数据分析、流程图生成、浏览器自动化、视频内容总结、服务器监控、智能图表六大智能体,支持自然语言交互和智能路由。☆116Jan 30, 2026Updated 3 months ago
- 【Accepted by ACM MM'25 🎉🎉】MS-DETR: Towards Effective Video Moment Retrieval and Highlight Detection by Joint Motion-Semantic Learning☆194Sep 26, 2025Updated 7 months ago
- Qurio brings multi-provider models, custom agents, reusable skills, MCP servers, HTTP tools, retrieval, long-term memory, Deep Research, …☆44Apr 21, 2026Updated 2 weeks ago
- Halo-Theme-AirCloud是一个简洁轻量的Halo博客主题, 旨在将中心放在博文本身.☆13Jul 10, 2024Updated last year
- Agentic Generative Engine Optimizaiton☆373Feb 24, 2026Updated 2 months ago
- A minimal, opinionated protocol for initializing new projects with both human-readable and agent-oriented documentation.☆74Apr 28, 2026Updated last week
- Universal on-premise pathology AI platform: plug in any foundation model, dataset, or cancer task hassle free.☆40Mar 16, 2026Updated last month
- A production-ready, engineering-grade Field-Oriented Control (FOC) motor drive system designed for high-performance robotic actuators, fe…☆70Apr 13, 2026Updated 3 weeks ago
- [CVPR 2026] Physical Simulator In-the-Loop Video Generation☆113Mar 25, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- KinkoClaw☆197Apr 7, 2026Updated 3 weeks ago
- Daily Chinese tech digest from Karpathy’s 90 curated blogs, with AI ranking, link analysis, and a static web reader. | 基于 Karpathy 精选 90 …☆38Feb 19, 2026Updated 2 months ago
- TaGAT For Multi-modal Retinal Image Fusion☆51Jul 31, 2024Updated last year
- Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM ju…☆126Apr 30, 2026Updated last week
- **A unified toolkit for distilling and accelerating video generation models and world models.**☆62Mar 19, 2026Updated last month
- 云原生成熟度评估☆345Apr 27, 2026Updated last week
- Official Implementation of 'OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model'☆422Mar 31, 2026Updated last month
- 按任务复杂度推进 AI 编程流程,让关键决策可追踪、产出质量可验证,并把计划、审查和历史沉淀为项目资产☆87Apr 29, 2026Updated last week
- ☆79Apr 7, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- claude code simplified (~2000 Lines)☆342Apr 2, 2026Updated last month
- **⚔️ Alicization Town** is a decentralized, multi-agent pixel sandbox world powered by the **Model Context Protocol (MCP)**. **⚔️ Ali…☆108Apr 6, 2026Updated last month
- This is the official repository for the paper "MLLM-Fabric: Multimodal Large Language Model-Driven Robotic Framework for Fabric Sorting a…☆39Oct 28, 2025Updated 6 months ago
- Scaling Autonomous Research in Medical Image Segmentation☆335Apr 14, 2026Updated 3 weeks ago
- ☆41Mar 6, 2026Updated 2 months ago
- Perceive, Predict, Verify: Continual Pre-training for Multimodal Agentic Foundation Models☆81Apr 1, 2026Updated last month
- ☆84Apr 3, 2026Updated last month