Arhosseini77 / Brand_Attention
Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis
☆26Updated last month
Related projects: ⓘ
- Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran☆11Updated 7 months ago
- [WACV2025] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆33Updated 3 weeks ago
- Official PyTorch Code and Models of "Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling", ICME 2024☆39Updated 6 months ago
- TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder☆20Updated last week
- ☆18Updated 6 months ago
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆17Updated 9 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆61Updated 4 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆30Updated 6 months ago
- ☆33Updated last week
- The codes of Siggraph Asia 2024 paper "Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation"☆25Updated 3 weeks ago
- Official PyTorch Implementation of "Scalable Autoregressive Image Generation with Mamba"☆95Updated 3 weeks ago
- Vico: Compositional Video Generation as Flow Equalization☆45Updated 2 months ago
- ☆58Updated 11 months ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆12Updated 8 months ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆30Updated last month
- Fast Sprite Decomposition from Animated Graphics [ECCV2024]☆25Updated 3 weeks ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆10Updated 3 weeks ago
- ☆15Updated last year
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆42Updated last week
- [ICLR 2024] Contextualized Diffusion Models for Text-Guided Image and Video Generation☆56Updated 3 months ago
- ☆38Updated 9 months ago
- ☆14Updated 8 months ago
- ☆65Updated this week
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆48Updated 2 weeks ago
- MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning☆52Updated last week
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆65Updated 4 months ago
- ☆17Updated 9 months ago
- Official implementation of "Perturbed-Attention Guidance"☆50Updated 2 months ago
- Diffusion base mining☆37Updated this week
- Official repo for the paper "Correcting Diffusion Generation through Resampling"☆27Updated 9 months ago