Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing
☆25Jan 13, 2026Updated 4 months ago
Alternatives and similar repositories for exploring-mmdit
Users that are interested in exploring-mmdit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Oct 30, 2024Updated last year
- Softimage like Vertex Color Ediotr☆12Jul 15, 2019Updated 6 years ago
- [ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing☆28May 11, 2026Updated 2 weeks ago
- Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple …☆131Mar 2, 2026Updated 2 months ago
- StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation☆43Jun 6, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repository of FlowAlign☆36May 18, 2026Updated last week
- [RSE 2025] RESTORE-DiT: Reliable satellite image time series reconstruction by multimodal sequential diffusion transformer☆58Jul 7, 2025Updated 10 months ago
- ☆25Dec 19, 2024Updated last year
- ICLR 2025 paper X-NeMo & Project X-Portrati2☆133Aug 7, 2025Updated 9 months ago
- [ICLR 2026] CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆30Nov 1, 2025Updated 6 months ago
- code for AAAI accepted paper Similarity Distribution based Membership Inference Attack on Person Re-Identification.☆11Sep 29, 2024Updated last year
- [ECCV 2024] Official Pytorch Implementation for "Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing"☆35Jun 16, 2025Updated 11 months ago
- Official repo for FaceShot: Bring Any Character into Life☆82Jun 30, 2025Updated 10 months ago
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Models☆52Sep 10, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios☆15Nov 19, 2024Updated last year
- JPEG-LM: LLMs as Image Generators with Canonical Codec Representations☆15Sep 29, 2024Updated last year
- Official PyTorch Implementation for Diffusion Hyperfeatures, NeurIPS 2023☆115Oct 21, 2024Updated last year
- ☆35Nov 5, 2024Updated last year
- ☆12Sep 8, 2023Updated 2 years ago
- A Python implementation of HMAC_DRBG (see, NiST SP 800-90A).☆14Nov 18, 2015Updated 10 years ago
- ☆11Jul 26, 2024Updated last year
- using rulsif for abrupt-change detection focusing on Environment, Usage, References, Introduction, Rulsif abrupt change detection.☆10Sep 3, 2025Updated 8 months ago
- MultiVariate Convolutional Neural Network☆10May 10, 2018Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆24Feb 26, 2025Updated last year
- ☆11Nov 29, 2024Updated last year
- ☆16Sep 11, 2025Updated 8 months ago
- Official implementation of "Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals" (CVPR 2026)☆37Feb 25, 2026Updated 3 months ago
- Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026 Highlight)☆57Feb 23, 2026Updated 3 months ago
- A distributed Key/Value storage, which uses client devices as a replica and stores each user data in a different partition☆16Aug 28, 2023Updated 2 years ago
- [VLM-Attack-Survey-2024] Paper list and projects for VLM attacks☆17Feb 12, 2025Updated last year
- Code of StyleCrafter on SDXL☆20Jun 25, 2024Updated last year
- A ComfyUI extension for StyleShot.☆16Apr 23, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Unofficial Implementation of E-LatentLPIPS(Ensembled-LatentLPIPS) of Diffusion2GAN☆42Jul 11, 2024Updated last year
- official code for "3D Question Answering via only 2D Vision-Language Models"☆23Mar 4, 2026Updated 2 months ago
- Tesal Stock Price Prediction Using Transformer☆34Feb 21, 2024Updated 2 years ago
- Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.☆37Jun 3, 2025Updated 11 months ago
- Official repository for GraphEQA☆25Sep 25, 2025Updated 8 months ago
- The official code of paper "Online Streaming Video Super-Resolution with Convolutional Look-Up Table".☆24Sep 8, 2024Updated last year
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year