[CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
☆17Oct 4, 2025Updated 6 months ago
Alternatives and similar repositories for LocalizationHeads
Users that are interested in LocalizationHeads are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [MICCAI 2024 Spotlight✨] Official Pytorch Code for Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning☆12Sep 4, 2024Updated last year
- [WACV 2025 ORAL] Official Pytorch Code for DragText: Rethinking Text Embedding in Point-based Image Editing☆14Jan 22, 2025Updated last year
- [ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…☆26Apr 3, 2025Updated last year
- ☆15Aug 28, 2024Updated last year
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"☆92Mar 9, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation☆24Jul 12, 2022Updated 3 years ago
- Implementation of various image-to-image translation models for photoacoustic imaging reconstruction.☆15Jan 15, 2026Updated 3 months ago
- [CVPR 2024 Highlight✨] Official Pytorch Code for EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation☆93Sep 12, 2024Updated last year
- [MICCAI 2024 Early Acceptance] Official Pytorch Code for Slice-Consistent 3D Volumetric Brain CT-to-MRI Translation with 2D Brownian Brid…☆63Jan 7, 2025Updated last year
- [NAACL 2025] Official Code Repository for the paper "Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval"☆22Jul 13, 2025Updated 9 months ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆14Sep 18, 2025Updated 7 months ago
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆106Feb 16, 2025Updated last year
- [CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation☆49Mar 27, 2025Updated last year
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Apr 25, 2025Updated last year
- [AAAI 2024] SVDP: Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction☆33Apr 26, 2024Updated 2 years ago
- [CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆70Aug 31, 2025Updated 8 months ago
- The Yahoo Finance Agent is an application that combines OpenAI's LLMs, the Yahoo Finance Python library, and LangChain's tools to provide…☆28Aug 10, 2024Updated last year
- [ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆74Jan 13, 2026Updated 3 months ago
- Java web application backed by the Ethereum-Blockchain network. Powered by RESTful web services (JAX-RS && Spring Boot) , Docker, Kuberne…☆14Feb 19, 2019Updated 7 years ago
- [ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval☆26Feb 13, 2025Updated last year
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆41Oct 2, 2022Updated 3 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆53Apr 23, 2025Updated last year
- ☆11Oct 13, 2024Updated last year
- For the rlhf learning environment of Koreans☆25Sep 25, 2023Updated 2 years ago
- [AAAI 2025] Official Implementation of I-HallA v1.0☆13Feb 2, 2025Updated last year
- Claude skill for finding ML research papers.☆207Apr 14, 2026Updated 2 weeks ago
- ☆11Jul 26, 2024Updated last year
- ☆16Sep 11, 2025Updated 7 months ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆94Jun 24, 2024Updated last year
- ☆11Aug 7, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆184Dec 14, 2025Updated 4 months ago
- ☆67Jan 4, 2026Updated 3 months ago
- 이론, 실무, 실전을 곁들인 인과추론☆27Aug 31, 2025Updated 8 months ago
- Object counting and instance segmentation with image-level supervision, in CVPR 2019☆12May 9, 2019Updated 6 years ago
- pdfChain: (experimental) blockchain for the masses☆16Feb 14, 2026Updated 2 months ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 3 months ago
- Official Pytorch Implementation of Unsupervised Image Denoising With Frequency Domain Knowledge (BMVC2021 Oral Accepted Paper)☆24Mar 15, 2022Updated 4 years ago