Arhosseini77 / Brand_Attention
Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement Analysis
☆27Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for Brand_Attention
- Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehran☆11Updated 9 months ago
- [WACV2025] SUM: Saliency Unification through Mamba for Visual Attention Modeling☆42Updated 2 months ago
- [NeurIPS 2024] Official implementation of "Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models."☆37Updated 3 weeks ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Updated 11 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆39Updated 3 months ago
- Official Code for GazeGNN: A Gaze-guided Graph Neural Network for Chest X-ray Classification [WACV 2024]☆15Updated last year
- Masked Vision-Language Transformer in Fashion☆33Updated last year
- The official Pytorch implementation of “BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation”☆28Updated last month
- [ICLR 2024] Official code for the paper "LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts"☆69Updated 6 months ago
- [ICML 2024] Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization☆16Updated 2 months ago
- (CVPR 2024) Official code for paper "Towards Language-Driven Video Inpainting via Multimodal Large Language Models"☆69Updated 7 months ago
- Official PyTorch implementation of "λ-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space"☆44Updated 7 months ago
- ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆33Updated 4 months ago
- Interactive Video Generation via Masked-Diffusion☆70Updated 7 months ago
- [ECCV 2024🔥] The official code for the paper AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors.☆44Updated 4 months ago
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆89Updated 6 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆40Updated last month
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆36Updated last month
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆44Updated 2 months ago
- [ECCV 2024] - ScanTalk: 3D Talking Heads from Unregistered Scans☆26Updated 3 weeks ago
- The codes of Siggraph Asia 2024 paper "Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation"☆34Updated 2 months ago
- Official PyTorch implementation for the paper Generalizable Face Landmarking Guided by Conditional Face Warping (CVPR 2024).☆20Updated this week
- ☆16Updated 3 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆52Updated 2 months ago
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Updated last year
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆75Updated 7 months ago
- Code for the paper "Pix2Video: Video Editing using Image Diffusion"☆64Updated last year
- ☆40Updated 11 months ago
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆34Updated 7 months ago
- (WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, B…☆81Updated 2 months ago