[ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models
☆46Jan 8, 2025Updated last year
Alternatives and similar repositories for FlexAttention
Users that are interested in FlexAttention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dynamic, high-resolution poverty measurement in data-scarce environments☆10Dec 8, 2024Updated last year
- Landsat-Bench: Datasets and Benchmarks for Landsat Foundation Models☆18Jun 18, 2025Updated 9 months ago
- Official implementation of the RSE paper mKGR.☆20Jan 15, 2026Updated 2 months ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD☆14Sep 13, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…☆22Jul 5, 2024Updated last year
- Retrieval-augmented Image Captioning☆13Feb 16, 2023Updated 3 years ago
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models☆165Mar 8, 2026Updated 3 weeks ago
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆11May 26, 2024Updated last year
- ✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models☆164Dec 26, 2024Updated last year
- ☆18Jul 16, 2019Updated 6 years ago
- Streaming Video Instruction Tuning☆65Feb 25, 2026Updated last month
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- CVPR25☆27Jul 2, 2025Updated 8 months ago
- KTCN: Enhancing Open-World Object Detection with Knowledge Tansfer and Class-Awareness Neutralization (IJCAI 24)☆11Aug 13, 2024Updated last year
- ☆29Sep 20, 2025Updated 6 months ago
- DOFA-CLIP: Multimodal Vision –Language Foundation Models for Earth Observation☆38Jul 30, 2025Updated 8 months ago
- [ECCV 2024 Workshop🎈] The first agriculture benchmark to evaluate MM-LLMs.☆24Jan 1, 2025Updated last year
- ☆23Aug 20, 2024Updated last year
- ☆14Sep 6, 2024Updated last year
- An up-to-date & curated list of awesome layout to image papers, methods & resources.☆13Jun 28, 2024Updated last year
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PyTorch Implementation of "Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Larg…☆43Mar 2, 2026Updated 3 weeks ago
- LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs☆417Dec 20, 2025Updated 3 months ago
- [ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.☆15Mar 12, 2024Updated 2 years ago
- A collection of papers related to Geo-spatial Information Science in NeurIPS 2024.☆56Jan 5, 2025Updated last year
- Masked Angle-Aware Autoencoder for Remote Sensing Images (ECCV 2024)☆28Nov 12, 2024Updated last year
- CPU Memory Compiler and Parallel programing☆26Nov 18, 2024Updated last year
- [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".☆204Jun 18, 2025Updated 9 months ago
- ☆66Mar 22, 2026Updated last week
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆45Jul 11, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A curated list of few-shot segmentation / few shot semantic segmentation / few shot image segmentation in remote sensing imagery.☆29Jun 25, 2024Updated last year
- ☆11Oct 2, 2024Updated last year
- ☆29Apr 23, 2025Updated 11 months ago
- The PyTorch implementation of AlignSeg.☆21Feb 26, 2025Updated last year
- VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation☆86Sep 12, 2024Updated last year
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆21Updated this week
- [CVPR 2025] Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation☆32Jun 27, 2025Updated 9 months ago