Arhosseini77 / Brand_Attention
Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis
☆29Updated 7 months ago
Alternatives and similar repositories for Brand_Attention:
Users that are interested in Brand_Attention are comparing it to the libraries listed below
- Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran☆11Updated last year
- [WACV2025 Oral] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆63Updated last month
- Deep Generative Models, University of Tehran, Dr.Tavassolipour☆14Updated last year
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆41Updated 6 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 8 months ago
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Updated 8 months ago
- Official PyTorch Code and Models of "Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling", ICME 2024☆49Updated 5 months ago
- ☆17Updated 2 years ago
- ☆23Updated 5 months ago
- LiVOS: Light Video Object Segmentation with Gated Linear Matching (CVPR 2025)☆28Updated 3 weeks ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆33Updated last year
- VimTS: A Unified Video and Image Text Spotter☆77Updated 4 months ago
- ☆64Updated 5 months ago
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆24Updated 2 months ago
- Masked Vision-Language Transformer in Fashion☆33Updated last year
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆35Updated 7 months ago
- Source code of the TextLap model, a LLM for text-2-layout generation.☆14Updated 5 months ago
- Official Code for GazeGNN: A Gaze-guided Graph Neural Network for Chest X-ray Classification [WACV 2024]☆15Updated last year
- More dimensions = More fun☆21Updated 8 months ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆25Updated 2 months ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆21Updated 2 months ago
- [NeurIPS 2024] Official implementation of "Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models."☆52Updated 3 months ago
- FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆14Updated 2 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆62Updated 11 months ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆84Updated last month
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆64Updated 6 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated last year
- Bilingual Medical Mixture of Experts LLM☆31Updated 4 months ago
- ☆13Updated 6 months ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆16Updated 8 months ago