endo-yuki-t / MAGView external linksLinks
PyTorch implementation of ``Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation'' [The Visual Computer]
☆25Jan 7, 2025Updated last year
Alternatives and similar repositories for MAG
Users that are interested in MAG are comparing it to the libraries listed below
Sorting:
- ☆18Jan 19, 2026Updated 3 weeks ago
- ☆17Aug 8, 2024Updated last year
- ☆24Sep 12, 2023Updated 2 years ago
- Accompanying notebook and sources to "A Guide to Pseudolabelling: How to get a Kaggle medal with only one model" (Dec. 2020 PyData Boston…☆28Jul 5, 2022Updated 3 years ago
- ☆29Jun 10, 2024Updated last year
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆19Jan 24, 2026Updated 3 weeks ago
- HuggingFace diffusers' pipeline to run ZestGuide☆43Mar 19, 2024Updated last year
- ☆10Jul 4, 2024Updated last year
- ☆14Nov 23, 2024Updated last year
- ROS wrapper of Nvidia Contact-graspnet model.☆17Jul 3, 2023Updated 2 years ago
- Implementation for "L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors"☆44Jun 16, 2025Updated 8 months ago
- Official PyTorch implementation for "Where You Edit is What You Get: Text-Guided Image Editing with Region-Based Attention" (Pattern Reco…☆10Oct 1, 2024Updated last year
- Sciter Preact Go Starter☆10Oct 13, 2021Updated 4 years ago
- [IROS 2022] Transporters with Visual Foresight (TVF)☆11Jul 25, 2022Updated 3 years ago
- ☆10Sep 12, 2024Updated last year
- ☆12May 16, 2024Updated last year
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"☆14Feb 21, 2024Updated last year
- ☆12Jun 30, 2023Updated 2 years ago
- Implementation of "Conditional Score Guidance for Text-Driven Image-to-Image Translation" (NeurIPS 2023).☆11Jul 19, 2023Updated 2 years ago
- ☆12Feb 2, 2024Updated 2 years ago
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆50Aug 1, 2024Updated last year
- ☆13May 27, 2025Updated 8 months ago
- "Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)☆12Jan 17, 2023Updated 3 years ago
- An Official Implementation for the Paper 'Point Beyond Class: A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-…☆18Oct 20, 2022Updated 3 years ago
- [AAAI 2024] An official implementation of the paper "LINGO-Space: Language-Conditioned Incremental Grounding for Space"☆13Jul 1, 2024Updated last year
- The codes of our paper "EasyInv: Toward Fast and Better DDIM Inversion"☆14Jun 1, 2025Updated 8 months ago
- [NeurIPS 2023]Federated Learning with Bilateral Curation for Partially Class-Disjoint Data☆14Aug 1, 2025Updated 6 months ago
- Repository with environment and training scripts for paper "Cross-Environment-Cooperation Enables Zero-shot Multi-agent Cooperation"☆18Sep 12, 2025Updated 5 months ago
- ☆16Mar 27, 2024Updated last year
- Objective metrics for measuring visual texture similarity using STSIM features. Supervised by Thrasos Pappas.☆15Oct 4, 2023Updated 2 years ago
- 小车端app,主要负责视频录制和传输。☆12Dec 8, 2017Updated 8 years ago
- Official code for 'One-Shot Object Localization in Medical Images based on Relative Position Regression'.☆12Sep 10, 2022Updated 3 years ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆53Sep 26, 2025Updated 4 months ago
- FreeDA: Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation (CVPR 2024)☆49Aug 28, 2024Updated last year
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆18Jul 7, 2024Updated last year
- The official repository of EffiVED☆19Jun 5, 2024Updated last year
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Mar 31, 2022Updated 3 years ago
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆12Oct 11, 2024Updated last year
- SLSDeep: Skin Lesion Segmentation Based on Dilated Residual and Pyramid Pooling Networks☆14Jun 28, 2018Updated 7 years ago