JIA-Lab-research / DreamOmni2Links
This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation''
☆2,328Updated 2 months ago
Alternatives and similar repositories for DreamOmni2
Users that are interested in DreamOmni2 are comparing it to the libraries listed below
Sorting:
- HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.☆1,296Updated 3 months ago
- Agent-ready RPA suite with out-of-the-box automation tools. Built for individuals and enterprises.☆5,462Updated this week
- A curated collection of my quantitative finance research projects. Explores sector rotation, multi-factor models, and AI-driven strategie…☆496Updated 3 weeks ago
- HunyuanVideo-1.5: A leading lightweight video generation model☆2,965Updated last week
- csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and App…☆1,486Updated this week
- 53AI Hub is an open-source AI portal, which enables you to quickly build a operational-level AI portal to launch and operate AI agents, p…☆6,886Updated 3 weeks ago
- Nexent is a zero-code platform for auto-generating agents — no orchestration, no complex drag-and-drop required. Nexent also offers power…☆4,108Updated this week
- Enterprise-grade, commercial-friendly agentic workflow platform for building next-generation SuperAgents.☆8,471Updated this week
- A high-performance IM server.☆4,067Updated last week
- Repository of AudioGenie☆234Updated 2 months ago
- 🔥 An agile development and testing platform designed to empower small and medium-sized enterprises to build their own R&D systems, strea…☆3,379Updated 2 weeks ago
- 全语言制品仓库,涵盖npm、Maven、PyPi、Docker、Gradle、SBT、Cocoapods、Swift、RPM、Debian、PHP、Go、Pub、Ivy、NuGet、Conda、Cargo、Conan、Yarn、GitLFS、Helm、OHPM等主流工具,涵…☆4,201Updated 2 weeks ago
- PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆2,805Updated last month
- Unified Multimodal Model for image generation/editing/understanding☆823Updated 4 months ago
- PageEyes Agent 是一个轻量级 UI Agent,通过自然语言指令驱动,无需编写脚本既可实现Web、Android平台的UI自动化任务。☆610Updated 2 weeks ago
- ☆308Updated 5 months ago
- Let's use AI to Earn!☆9,954Updated last week
- Tutorial for deep learning(AIGC)☆124Updated last month
- 🔥 A unified system resource management platform designed for administrators, serving as the foundational module for the Angus applicatio…☆1,072Updated this week
- Think Beyond Images☆549Updated 3 months ago
- 🔥 OpenAPIDesigner is an open-source OpenAPI specification design tool that allows developers to design, write, and validate OpenAPI spec…☆568Updated 2 months ago
- Official Repo For "BindWeave: Subject-Consistent Video Generation via Cross-Modal Integration"☆362Updated last month
- Moxin is a family of fully open-source and reproducible LLMs☆621Updated 6 months ago
- A fast gigapixel processing system☆2,008Updated last year
- 🔥 AngusInfra is a foundational framework for rapidly developing multi-tenant web applications, built on the Enterprise-level development…☆548Updated this week
- ☆183Updated 3 months ago
- Video generation from text&image, 1st-gen☆918Updated 7 months ago
- ✨ WithAnyone is capable of generating high-quality, controllable, and ID consistent images☆542Updated 3 weeks ago
- 🔥 JMock is a high-performance data generation and simulation component library implemented in Java.☆422Updated 2 months ago
- ⛲Imagination, Given Voice.✨☆875Updated this week