Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Function" [NeurIPS 2024]
☆30Dec 2, 2024Updated last year
Alternatives and similar repositories for Magnet
Users that are interested in Magnet are comparing it to the libraries listed below
Sorting:
- [AAAI 2024] Transformer-Based No-Reference Image Quality Assessment via Supervised Contrastive Learning☆14Feb 20, 2024Updated 2 years ago
- TMM 2024 The code of ' Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution '.☆28Mar 24, 2024Updated last year
- ☆10Sep 30, 2024Updated last year
- The official implementation for Detector Guidance for Multi-Object Text-to-Image Generation (DG)☆20Feb 7, 2024Updated 2 years ago
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- [Under Review] Efficient and Generic Point Model for Lossless Point Cloud Attribute Compression☆40Apr 11, 2024Updated last year
- The implementation of 'M3Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection'.☆12Apr 18, 2025Updated 11 months ago
- A curated list of Text-to-Video Generation papers and BibTeX entries☆21Feb 21, 2024Updated 2 years ago
- Official implementation of the paper "MotionCrafter: One-Shot Motion Customization of Diffusion Models"☆29Jan 4, 2024Updated 2 years ago
- Code for "Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding" (ICCV 2023)☆14Oct 2, 2024Updated last year
- ☆17Jul 23, 2024Updated last year
- (TMM 2022)Quality Assessment for Omnidirectional Video: A Spatio-Temporal Distortion Modeling Approach☆12Jun 16, 2021Updated 4 years ago
- [CVPR 2024] Code for "Improved Visual Grounding through Self-Consistent Explanations".☆27Mar 1, 2024Updated 2 years ago
- A Multi-scale Transformer-based Decoder for Semantic Segmentation☆20Aug 16, 2023Updated 2 years ago
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆64Jul 31, 2025Updated 7 months ago
- Exploring Content-Aware Strategies in Change Detection for Robust Bi-Temporal Image Analysis☆16Jul 13, 2025Updated 8 months ago
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆37Feb 26, 2025Updated last year
- ColorNet: A learning-based colorfulness estimator for natural images☆18Sep 11, 2019Updated 6 years ago
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing☆17Mar 16, 2025Updated last year
- This repo uses yolov5 to detect the door handle☆10Apr 20, 2021Updated 4 years ago
- Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"☆16Apr 22, 2024Updated last year
- (ACM MM 2022 Workshop APCCPA) IPDAE: Improved Patch-Based Deep Autoencoder for Lossy Point Cloud Geometry Compression.☆16Mar 15, 2024Updated 2 years ago
- This repo contains the code for PreciseControl project [ECCV'24]☆70Oct 6, 2024Updated last year
- Part of official implementation of "Natural language-informed learning of molecule graphs"☆18Jul 17, 2023Updated 2 years ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆36Apr 21, 2024Updated last year
- TACO: TFBS-Aware Cis-Regulatory Element Optimization☆21Aug 1, 2025Updated 7 months ago
- ☆15Mar 30, 2025Updated 11 months ago
- Website for the MIT/Harvard Computational Neuroscience Journal Club☆11Apr 7, 2025Updated 11 months ago
- Official implementation for the CVPR 2024 paper CAMEL☆20Jun 20, 2024Updated last year
- ☆44Mar 12, 2026Updated last week
- ☆19Apr 1, 2025Updated 11 months ago
- FreeCond: A Free Lunch for Input Conditions in Text-Guided Inpainting. FreeCond introduces a more generalized form💪 of the original inpa…☆15May 22, 2025Updated 9 months ago
- [TMM 2023] Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token☆14Mar 21, 2024Updated last year
- (ECCV 2024) Official implementation of Paper ''DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation''☆39Oct 24, 2024Updated last year
- 🌎NUAA 2018 网络安全 - 端口扫描☆11Jul 2, 2018Updated 7 years ago
- [ICLR ML4RS 2025] Official implementation for the paper "Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model"☆14Feb 2, 2026Updated last month
- ☆83Nov 25, 2024Updated last year
- K. Chi, Y. Yuan, and Q. Wang*, “Trinity-Net: Gradient-Guided Swin Transformer-Based Remote Sensing Image Dehazing and Beyond,” IEEE Trans…☆11Jan 31, 2023Updated 3 years ago
- Official implementation of SimFlow☆27Dec 16, 2025Updated 3 months ago