hananshafi/llmblueprint

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hananshafi/llmblueprint)

hananshafi / llmblueprint

[ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"

☆85

Alternatives and similar repositories for llmblueprint

Users that are interested in llmblueprint are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HashmatShadab / MambaRobustness
View on GitHub
[CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"
☆26Jun 8, 2025Updated last year
fahadshamshad / deep-facial-privacy-prior
View on GitHub
[ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".
☆12Oct 11, 2024Updated last year
rohit901 / VANE-Bench
View on GitHub
[NAACL'25] Contains code and documentation for our VANE-Bench paper.
☆24Aug 19, 2025Updated 11 months ago
Razaimam45 / TTL-Test-Time-Low-Rank-Adaptation
View on GitHub
Official code repository of paper titled "Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Visio…
☆34May 11, 2025Updated last year
Muhammad-Huzaifaa / ObjectCompose
View on GitHub
[ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes 🚀🚀🚀
☆37Jan 21, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
akhtarvision / weather-regional
View on GitHub
☆11Oct 29, 2024Updated last year
ShahinaKK / LWI-VMS
View on GitHub
Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]
☆22Oct 27, 2024Updated last year
ShahinaKK / LG_SDG
View on GitHub
Language Grounded Single Source Domain Generalization in Medical Image Segmentation [ISBI2024]
☆33Oct 27, 2024Updated last year
mbzuai-oryx / CVRR-Evaluation-Suite
View on GitHub
[CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…
☆50Aug 23, 2024Updated last year
abdohelmy / D-3Former
View on GitHub
Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".
☆25Jul 10, 2023Updated 3 years ago
hananshafi / MTL-ViT
View on GitHub
A new multi-task learning framework using Vision Transformers
☆11Jun 19, 2024Updated 2 years ago
HashmatShadab / APR
View on GitHub
(BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …
☆35Jan 8, 2023Updated 3 years ago
asif-hanif / vafa
View on GitHub
[MICCAI 2023] Official code repository of paper titled "Frequency Domain Adversarial Training for Robust Volumetric Medical Segmentation"…
☆52Nov 14, 2023Updated 2 years ago
jameelhassan / PromptAlign
View on GitHub
[NeurIPS 2023] Align Your Prompts: Test-Time Prompting with Distribution Alignment for Zero-Shot Generalization
☆107Feb 11, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
mbzuai-oryx / VideoGLaMM
View on GitHub
[CVPR 2025 🔥]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
☆104Apr 14, 2025Updated last year
tsunghan-wu / SLD
View on GitHub
🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)
☆187Apr 9, 2024Updated 2 years ago
amandpkr / XM-GAN
View on GitHub
[MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Cla…
☆47Sep 28, 2023Updated 2 years ago
Muhammad-Ibraheem-Siddiqui / PerSense
View on GitHub
[BMVC 2025] Official Implementation of the paper "PerSense: Personalized Instance Segmentation in Dense Images"
☆31Dec 18, 2025Updated 7 months ago
HashmatShadab / Robustness-of-Volumetric-Medical-Segmentation-Models
View on GitHub
[BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
☆15Nov 1, 2024Updated last year
akhtarvision / cal-detr
View on GitHub
☆42Nov 9, 2023Updated 2 years ago
TonyLianLong / LLM-groundedDiffusion
View on GitHub
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusi…
☆483Sep 9, 2024Updated last year
aminebdj / 3D-OWIS
View on GitHub
[NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …
☆68Dec 3, 2023Updated 2 years ago
BioMedIA-MBZUAI / MedPromptX
View on GitHub
☆71Jul 2, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
mbzuai-oryx / Video-LLaVA
View on GitHub
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
☆264Aug 5, 2025Updated 11 months ago
muzairkhattak / transformers-transforming-vision
View on GitHub
Validating image classification benchmark results on ViTs and ResNets (v2)
☆13Nov 3, 2022Updated 3 years ago
fahadshamshad / Clip2Protect
View on GitHub
[CVPR 2023] Official repository of paper titled "CLIP2Protect: Protecting Facial Privacy using Text-Guided Makeup via Adversarial Latent …
☆105Mar 25, 2024Updated 2 years ago
YeLuoSuiYou / openstorypp
View on GitHub
We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.
☆18Aug 30, 2024Updated last year
mbzuai-oryx / ALM-Bench
View on GitHub
[CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…
☆47May 26, 2025Updated last year
HashmatShadab / Robust-LLaVA
View on GitHub
[ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models
☆29Oct 20, 2025Updated 9 months ago
OmkarThawakar / composed-video-retrieval
View on GitHub
Composed Video Retrieval
☆62May 2, 2024Updated 2 years ago
hananshafi / MedContext
View on GitHub
[MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"
☆14Nov 1, 2024Updated last year
amandpkr / GMNR
View on GitHub
(ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.
☆18Sep 28, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
muzairkhattak / ViFi-CLIP
View on GitHub
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
☆309Apr 3, 2024Updated 2 years ago
mmaaz60 / mdef_detr
View on GitHub
☆11May 9, 2023Updated 3 years ago
mbzuai-oryx / VideoMolmo
View on GitHub
Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"
☆56Jul 5, 2025Updated last year
muzairkhattak / multimodal-prompt-learning
View on GitHub
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
☆819Jul 24, 2023Updated 3 years ago
AiMl-hub / UPLM
View on GitHub
Uncertainty-Guided Pseudo-Labelling with Model Averaging
☆11Mar 17, 2026Updated 4 months ago
mlpc-ucsd / TokenCompose
View on GitHub
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
☆137Dec 21, 2024Updated last year
mbzuai-oryx / BiMediX
View on GitHub
Bilingual Medical Mixture of Experts LLM
☆33Nov 23, 2024Updated last year