🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.
☆79May 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for Unify-Agent
Users that are interested in Unify-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 同济大学2019级数据库课程设计项目☆11Sep 11, 2021Updated 4 years ago
- [CVPR2026 Highlight] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens https://arxiv.org/abs…☆54Apr 10, 2026Updated last month
- [Preprint] Self-Adversarial One Step Generation via Condition Shifting☆52Apr 15, 2026Updated last month
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆47Jul 22, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14May 13, 2026Updated last week
- official github code for "SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing"☆125Apr 27, 2026Updated 3 weeks ago
- user-friendly Qt desktop application that enables seamless interaction with various AI language models using the Ollama backend.☆20Mar 22, 2025Updated last year
- (Siggraph Asia 2025) Code of "LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization"☆31Dec 29, 2025Updated 4 months ago
- The Official PyTorch implementation of EigenLoRAx: Recycling Adapters to Find Principal Subspaces for Resource-Efficient Adaptation and I…☆25Feb 4, 2026Updated 3 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 5 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆32Feb 10, 2026Updated 3 months ago
- 电子科技大学本科课程代码。☆16Dec 31, 2023Updated 2 years ago
- Code for paper "PoseEmbroider:Towards a 3D, Visual, Semantic-aware Human Pose Representation" (ECCV 2024)☆18Nov 18, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker☆55Apr 16, 2026Updated last month
- ☆11Sep 19, 2025Updated 8 months ago
- ☆30Oct 8, 2025Updated 7 months ago
- DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…☆30Sep 7, 2025Updated 8 months ago
- dataset, environment, and other resources for mrCAD paper☆24Sep 19, 2025Updated 8 months ago
- Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"☆63Mar 23, 2026Updated 2 months ago
- The code repository of UniRL☆52May 30, 2025Updated 11 months ago
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- ☆14Jul 17, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of followinf estimation algorithms in python: Kalman Filter, Extended Kalman Filter, Unscented Kalman Filter, Cubature Kal…☆11Dec 2, 2023Updated 2 years ago
- TESGNN: 3D Temporal Equivariant Scene Graph Neural Networks (published at TMLR)☆14Nov 2, 2025Updated 6 months ago
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 9 months ago
- [ACL 2026 Findings, ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆119Apr 8, 2026Updated last month
- ANDROID APP that can RECOGNIZE VLC LIVE AUDIO/VIDEO STREAMING (using free Android Developers Speech Recognition API) then TRANSLATE (usin…☆13May 5, 2024Updated 2 years ago
- ☆15Oct 24, 2024Updated last year
- Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation☆15Mar 28, 2026Updated last month
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated last year
- ☆14Jan 22, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICML'26] VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆57May 18, 2026Updated last week
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆24Dec 10, 2025Updated 5 months ago
- EasyTTS是一个便捷的工具,旨在方便地使用第三方API服务来调用OpenAI的文本转语音(TTS)功能。 EasyTTS允许用户输入文本,并选择不同的模型、音色、格式来生成音频文件。☆10Nov 26, 2023Updated 2 years ago
- (3DV 2026) Pytorch implementation of “InterPose: Learning to Generate Human-Object Interactions from Large-Scale Web Videos”☆26Mar 16, 2026Updated 2 months ago
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking☆38Jan 20, 2026Updated 4 months ago
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆12Nov 13, 2024Updated last year
- [NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning☆106Sep 19, 2025Updated 8 months ago