[ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing
☆25Jan 27, 2026Updated last month
Alternatives and similar repositories for DIM
Users that are interested in DIM are comparing it to the libraries listed below
Sorting:
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 2 years ago
- [CVPR 2025] DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles☆29May 13, 2025Updated 9 months ago
- [ICCV 2025] Official repository of DiffSim: Taming Diffusion Models for Evaluating Visual Similarity☆29Jul 14, 2025Updated 7 months ago
- [ICLR 2025] Official lmplementation of SPM-Diff: Incorporating Visual Correspondence into Diffusion Model for Virtual Try-On☆48Mar 3, 2025Updated 11 months ago
- [ICRA 2026] StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes☆20Feb 17, 2026Updated last week
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆28Feb 18, 2026Updated last week
- ☆33Nov 25, 2025Updated 3 months ago
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆110Jul 27, 2025Updated 7 months ago
- Simple Implementation of the CVPR 2024 Paper "JointSQ: Joint Sparsification-Quantization for Distributed Learning"☆11Dec 29, 2024Updated last year
- ☆22Dec 23, 2025Updated 2 months ago
- A powerful integration that combines Browserbase's Stagehand with Mastra for advanced web automation, scraping, and AI-powered web intera…☆33Feb 4, 2026Updated 3 weeks ago
- DragMesh: Interactive 3D Generation Made Easy☆20Dec 28, 2025Updated last month
- [NeurIPS 2025] CodeCrash: Exposing LLM Fragility to Misleading Natural Language in Code Reasoning☆16Jan 24, 2026Updated last month
- Tusk Drift Demo - Node.js Service☆58Jan 20, 2026Updated last month
- ☆17Aug 5, 2025Updated 6 months ago
- Community maintained hardware plugin for vLLM on AWS Neuron☆23Feb 11, 2026Updated 2 weeks ago
- Video to Text Translation + VTT Subtitle Generation + WebService☆13Feb 11, 2024Updated 2 years ago
- a viewer for for lancedb. including some actions like CRUD etc☆12Apr 27, 2025Updated 10 months ago
- Hybrid Deep-learning and Iterative Reconstruction Scheme for Medical Imaging Reconstruction☆11Sep 26, 2023Updated 2 years ago
- ☆11Oct 22, 2025Updated 4 months ago
- Exercises for the Dafny Tutorial☆14May 21, 2018Updated 7 years ago
- A simple and efficient .NET library for accessing Anthropic's Claude AI API. This community-provided library allows you to easily integra…☆10Oct 10, 2024Updated last year
- Blockscout Docker image☆10Apr 29, 2020Updated 5 years ago
- ☆16Dec 10, 2025Updated 2 months ago
- [ICCV 2025] Balanced Image Stylization with Style Matching Score☆67Sep 30, 2025Updated 5 months ago
- Codes for our WACV2017 paper: "On Geometric Features for Skeleton-Based Action Recognition Using Multilayer LSTM Networks"☆10Jul 7, 2020Updated 5 years ago
- Repository of GUI Action Narrator☆12Apr 8, 2025Updated 10 months ago
- ☆18Jan 8, 2026Updated last month
- [CVPR 2022] Sequential Voting with Relational Box Fields for Active Object Detection☆10Jun 19, 2022Updated 3 years ago
- Asynchronous Data-Driven SynapseCore Engine for Real-Time Predictive-Modeling and Scalable Data-Enrichment on a Microservices Platform.☆44Feb 8, 2026Updated 2 weeks ago
- LangBot Plugin Infra including plugin runtime, SDK and CLI tools.☆21Updated this week
- backend wrapper for memU☆35Updated this week
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 5 months ago
- The purpose of this command line tool is to support the conversion of an elementary stream file into a format that can be placed directly…☆11Oct 27, 2025Updated 4 months ago
- torchvision-based transforms that provide access to parameterization☆16Dec 4, 2025Updated 2 months ago
- ☆21Jan 8, 2026Updated last month
- Template repository for the Werewolf hackathon☆18Nov 9, 2024Updated last year
- The Deep Supervised Hashing for Image Retrieval on CIFAR10/MNIST/Fashion-MNIST☆12Nov 23, 2017Updated 8 years ago
- Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks [ICLR 2026]☆26Updated this week