[CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
☆17Oct 4, 2025Updated 6 months ago
Alternatives and similar repositories for LocalizationHeads
Users that are interested in LocalizationHeads are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [MICCAI 2024 Spotlight✨] Official Pytorch Code for Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning☆12Sep 4, 2024Updated last year
- [WACV 2025 ORAL] Official Pytorch Code for DragText: Rethinking Text Embedding in Point-based Image Editing☆14Jan 22, 2025Updated last year
- [ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segm…☆25Apr 3, 2025Updated last year
- ☆15Aug 28, 2024Updated last year
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"☆92Mar 9, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation☆24Jul 12, 2022Updated 3 years ago
- [MICCAI 2024 Early Acceptance] Official Pytorch Code for Slice-Consistent 3D Volumetric Brain CT-to-MRI Translation with 2D Brownian Brid…☆59Jan 7, 2025Updated last year
- [CVPR 2024 Highlight✨] Official Pytorch Code for EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation☆92Sep 12, 2024Updated last year
- Implementation of various image-to-image translation models for photoacoustic imaging reconstruction.☆14Jan 15, 2026Updated 2 months ago
- [NAACL 2025] Official Code Repository for the paper "Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval"☆21Jul 13, 2025Updated 8 months ago
- [WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval☆13Sep 18, 2025Updated 6 months ago
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆104Feb 16, 2025Updated last year
- [CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation☆50Mar 27, 2025Updated last year
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆14Apr 25, 2025Updated 11 months ago
- [AAAI 2024] SVDP: Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction☆33Apr 26, 2024Updated last year
- The Yahoo Finance Agent is an application that combines OpenAI's LLMs, the Yahoo Finance Python library, and LangChain's tools to provide…☆27Aug 10, 2024Updated last year
- [ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆71Jan 13, 2026Updated 2 months ago
- [CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆67Aug 31, 2025Updated 7 months ago
- Java web application backed by the Ethereum-Blockchain network. Powered by RESTful web services (JAX-RS && Spring Boot) , Docker, Kuberne…☆14Feb 19, 2019Updated 7 years ago
- [ICLR 2025] TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval☆26Feb 13, 2025Updated last year
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆41Oct 2, 2022Updated 3 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆53Apr 23, 2025Updated 11 months ago
- ☆11Oct 13, 2024Updated last year
- [AAAI 2025] Official Implementation of I-HallA v1.0☆13Feb 2, 2025Updated last year
- For the rlhf learning environment of Koreans☆25Sep 25, 2023Updated 2 years ago
- ☆11Jul 26, 2024Updated last year
- 🦀 An immediate-mode Rust TUI framework with flexbox layout and Tailwind-style chaining API.☆81Updated this week
- ☆16Sep 11, 2025Updated 7 months ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆94Jun 24, 2024Updated last year
- ☆11Aug 7, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs☆183Dec 14, 2025Updated 3 months ago
- ☆66Jan 4, 2026Updated 3 months ago
- Object counting and instance segmentation with image-level supervision, in CVPR 2019☆12May 9, 2019Updated 6 years ago
- 이론, 실무, 실전을 곁들인 인과추론☆27Aug 31, 2025Updated 7 months ago
- pdfChain: (experimental) blockchain for the masses☆16Feb 14, 2026Updated last month
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 3 months ago
- Official Pytorch Implementation of Unsupervised Image Denoising With Frequency Domain Knowledge (BMVC2021 Oral Accepted Paper)☆24Mar 15, 2022Updated 4 years ago