I2-Multimedia-Lab/Magnet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/I2-Multimedia-Lab/Magnet)

I2-Multimedia-Lab / Magnet

Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function" [NeurIPS 2024]

☆31

Alternatives and similar repositories for Magnet

Users that are interested in Magnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

I2-Multimedia-Lab / PL2-Transformer
View on GitHub
Official implement of "Point Long-Term Locality-Aware Transformer for Point Cloud Video Understanding"
☆28Mar 24, 2026Updated 3 months ago
renjie3 / MemAttn
View on GitHub
☆16Feb 23, 2025Updated last year
I2-Multimedia-Lab / Simba
View on GitHub
[AAAI2026] An official repository of Simba: Towards High-Fidelity and Geometrically-Consistent Point Cloud Completion via Transformation …
☆16Nov 22, 2025Updated 8 months ago
hutaiHang / ToMe
View on GitHub
[NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis
☆86Feb 3, 2025Updated last year
MischaD / BeyondFID
View on GitHub
A python package to streamline evaluation of unconditional image generation models
☆17Apr 14, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RoyiRa / Linguistic-Binding-in-Diffusion-Models
View on GitHub
☆82Nov 25, 2024Updated last year
I2-Multimedia-Lab / DSAT
View on GitHub
TMM 2024 The code of ' Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution '.
☆28Mar 24, 2024Updated 2 years ago
chunsanHong / MemBench_code
View on GitHub
☆12Sep 30, 2024Updated last year
luping-liu / LongAlign
View on GitHub
The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)
☆83Apr 23, 2025Updated last year
luping-liu / Detector-Guidance
View on GitHub
The official implementation for Detector Guidance for Multi-Object Text-to-Image Generation (DG)
☆20Feb 7, 2024Updated 2 years ago
WangWenhao0716 / PDF-Embedding
View on GitHub
[NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"
☆18Oct 1, 2024Updated last year
liuting20 / DARA
View on GitHub
[ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
☆22Feb 26, 2025Updated last year
I2-Multimedia-Lab / PoLoPCAC
View on GitHub
[Under Review] Efficient and Generic Point Model for Lossless Point Cloud Attribute Compression
☆40Apr 11, 2024Updated 2 years ago
ssyang2020 / ZeroSmooth
View on GitHub
☆66Jun 4, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Ayews / M3Net
View on GitHub
The implementation of 'M3Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection'.
☆12Apr 18, 2025Updated last year
feifeiobama / Awesome-Text-to-Video-Generation
View on GitHub
A curated list of Text-to-Video Generation papers and BibTeX entries
☆21Feb 21, 2024Updated 2 years ago
YaNgZhAnG-V5 / attention_regulation
View on GitHub
[ECCV24] Attention Regulation on T2I Diffusion Models
☆19Jul 8, 2024Updated 2 years ago
ZzZZCHS / WS-3DVG
View on GitHub
[ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
☆14Oct 2, 2024Updated last year
I2-Multimedia-Lab / 360-video-experimental-platform
View on GitHub
(TMM 2022)Quality Assessment for Omnidirectional Video: A Spatio-Temporal Distortion Modeling Approach
☆12Jun 16, 2021Updated 5 years ago
zyxElsa / MotionCrafter
View on GitHub
Official implementation of the paper "MotionCrafter: One-Shot Motion Customization of Diffusion Models"
☆29Jan 4, 2024Updated 2 years ago
KwonGihyun / TweedieMix
View on GitHub
Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)
☆62Jan 22, 2025Updated last year
jiangmengli / MetaMask
View on GitHub
☆13Sep 16, 2022Updated 3 years ago
sunalbert / lucid.pytorch
View on GitHub
A neural network visualization toolkit for pytorch
☆14Feb 2, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
clf28 / Detail-plus-plus
View on GitHub
[IEEE TIP] Official implementation of Progressive Detail Injection for Training-Free Semantic Binding in Text-to-Image Generation
☆33Aug 3, 2025Updated 11 months ago
prescient-design / CBGM
View on GitHub
Concept-based generative models
☆12Dec 13, 2024Updated last year
krafton-ai / Rare-to-Frequent
View on GitHub
Rare-to-Frequent (R2F), ICLR'25, Spotlight
☆53Apr 23, 2025Updated last year
zhangxulu1996 / Compositional-Inversion
View on GitHub
Compositional Inversion for Stable Diffusion Models (AAAI 2024)
☆37Feb 26, 2025Updated last year
CASIA-IVA-Lab / SC-Tune
View on GitHub
Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"
☆16Apr 22, 2024Updated 2 years ago
shiwt03 / MUSTER
View on GitHub
A Multi-scale Transformer-based Decoder for Semantic Segmentation
☆21Aug 16, 2023Updated 2 years ago
V-Sense / colornet-estimating-colorfulness
View on GitHub
ColorNet: A learning-based colorfulness estimator for natural images
☆18Sep 11, 2019Updated 6 years ago
louisYen / Gen4Gen
View on GitHub
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
☆110Mar 27, 2026Updated 3 months ago
clement-bonnet / text-to-pose
View on GitHub
Paper: "From Text to Pose to Image: Improving Diffusion Model Control and Quality"
☆60Nov 30, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rishubhpar / PreciseControl
View on GitHub
This repo contains the code for PreciseControl project [ECCV'24]
☆71Oct 6, 2024Updated last year
YangLing0818 / RealCompo
View on GitHub
[NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models
☆121Nov 14, 2024Updated last year
yangzhao1230 / GraphTextRetrieval
View on GitHub
Part of official implementation of "Natural language-informed learning of molecule graphs"
☆18Jul 17, 2023Updated 3 years ago
jiuntian / interactdiffusion
View on GitHub
[CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".
☆126Jun 18, 2025Updated last year
I2-Multimedia-Lab / UGRAN
View on GitHub
[TIP2025] The implementation of "Uncertainty Guided Refinement for Fine-grained Salient Object Detection"
☆18Apr 20, 2025Updated last year
UCSC-VLAA / MixCon3D
View on GitHub
[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"
☆35Apr 21, 2024Updated 2 years ago
Amazingren / NTIRE2026_ESR
View on GitHub
(CVPRW2026) Solution of the NTIRE 2026 Challenge on Efficient Super-Resolution
☆16Jun 28, 2026Updated 3 weeks ago