[CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
☆17Oct 4, 2025Updated 8 months ago
Alternatives and similar repositories for LocalizationHeads
Users that are interested in LocalizationHeads are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [MICCAI 2024 Spotlight✨] Official Pytorch Code for Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning☆13Sep 4, 2024Updated last year
- [WACV 2025 ORAL] Official Pytorch Code for DragText: Rethinking Text Embedding in Point-based Image Editing☆14Jan 22, 2025Updated last year
- [ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…☆27Apr 3, 2025Updated last year
- ☆15Aug 28, 2024Updated last year
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"☆95Mar 9, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation☆24Jul 12, 2022Updated 3 years ago
- Implementation of various image-to-image translation models for photoacoustic imaging reconstruction.☆15Jan 15, 2026Updated 4 months ago
- [MICCAI 2024 Early Acceptance] Official Pytorch Code for Slice-Consistent 3D Volumetric Brain CT-to-MRI Translation with 2D Brownian Brid…☆63Jan 7, 2025Updated last year
- [CVPR 2024 Highlight✨] Official Pytorch Code for EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation☆94Sep 12, 2024Updated last year
- [NAACL 2025] Official Code Repository for the paper "Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval"☆22Jul 13, 2025Updated 10 months ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆14Sep 18, 2025Updated 8 months ago
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆111Feb 16, 2025Updated last year
- [CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation☆50Mar 27, 2025Updated last year
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆14Apr 25, 2025Updated last year
- [AAAI 2024] SVDP: Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction☆33Apr 26, 2024Updated 2 years ago
- [CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆75Aug 31, 2025Updated 9 months ago
- The Yahoo Finance Agent is an application that combines OpenAI's LLMs, the Yahoo Finance Python library, and LangChain's tools to provide…☆29Aug 10, 2024Updated last year
- [ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆75Jan 13, 2026Updated 4 months ago
- Java web application backed by the Ethereum-Blockchain network. Powered by RESTful web services (JAX-RS && Spring Boot) , Docker, Kuberne…☆14Feb 19, 2019Updated 7 years ago
- [ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval☆26Feb 13, 2025Updated last year
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆41Oct 2, 2022Updated 3 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Oct 13, 2024Updated last year
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆53Apr 23, 2025Updated last year
- For the rlhf learning environment of Koreans☆25Sep 25, 2023Updated 2 years ago
- ☆11Jul 26, 2024Updated last year
- Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing☆25Jan 13, 2026Updated 4 months ago
- ☆17Sep 11, 2025Updated 9 months ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆94Jun 24, 2024Updated last year
- ☆12Aug 7, 2024Updated last year
- ☆67Jan 4, 2026Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 이론, 실무, 실전을 곁들인 인과추론☆27Aug 31, 2025Updated 9 months ago
- Object counting and instance segmentation with image-level supervision, in CVPR 2019☆12May 9, 2019Updated 7 years ago
- [AAAI 2025] Official Implementation of I-HallA v1.0☆16Feb 2, 2025Updated last year
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆188Dec 14, 2025Updated 5 months ago
- [NeurIPS 2025] Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM☆25Feb 10, 2026Updated 4 months ago
- Claude skill for finding ML research papers.☆213Apr 14, 2026Updated last month
- pdfChain: (experimental) blockchain for the masses☆16Feb 14, 2026Updated 3 months ago