UMass-Embodied-AGI/FlexAttention

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UMass-Embodied-AGI/FlexAttention)

UMass-Embodied-AGI / FlexAttention

[ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models

☆49

Alternatives and similar repositories for FlexAttention

Users that are interested in FlexAttention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hasanar1f / HiRED
View on GitHub
[AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…
☆58Apr 18, 2025Updated last year
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
Z-Zheng / dynamic_highres_poverty
View on GitHub
Dynamic, high-resolution poverty measurement in data-scarce environments
☆11Dec 8, 2024Updated last year
EricWWWW / image-caption-metrics
View on GitHub
a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD
☆14Sep 13, 2022Updated 3 years ago
Luo-Z13 / GLH-Bridge-page
View on GitHub
[TPAMI2024] Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery
☆15Mar 18, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
VisionXLab / Moment-Video
View on GitHub
☆18Jun 2, 2026Updated last month
liuting20 / DARA
View on GitHub
[ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
☆22Feb 26, 2025Updated last year
haoyu-bu / CAFe
View on GitHub
Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"
☆33Mar 26, 2025Updated last year
saibr / hypvl
View on GitHub
This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…
☆21Jul 5, 2024Updated 2 years ago
isaaccorley / landsatbench
View on GitHub
Landsat-Bench: Datasets and Benchmarks for Landsat Foundation Models
☆20Jun 18, 2025Updated last year
zhaoxuhui / downloadGoogleMap
View on GitHub
☆18Jul 16, 2019Updated 7 years ago
alipay / POA
View on GitHub
☆22Aug 8, 2024Updated last year
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
pufanyi / syphus
View on GitHub
Syphus: Automatic Instruction-Response Generation Pipeline
☆14Dec 14, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yfzhang114 / SliME
View on GitHub
✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models
☆163Dec 26, 2024Updated last year
MiliLab / Text-Before-Vision
View on GitHub
[ICML 2026] Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-High-Resolution Remote Sensing Understanding
☆16Mar 13, 2026Updated 4 months ago
42Shawn / LLaVA-PruMerge
View on GitHub
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
☆173Mar 8, 2026Updated 4 months ago
NMS05 / Patch-Aligned-Contrastive-Learning
View on GitHub
☆24Jul 8, 2023Updated 3 years ago
harrylin-hyl / SGROD
View on GitHub
☆14Sep 6, 2024Updated last year
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
earth-insights / awesome-layout-to-image
View on GitHub
An up-to-date & curated list of awesome layout to image papers, methods & resources.
☆13Jun 28, 2024Updated 2 years ago
vvangfaye / HoliTracer
View on GitHub
[ICCV25] Official implementation of the paper HoliTracer.
☆48Apr 7, 2026Updated 3 months ago
Lil-Shake / VA-Pi
View on GitHub
[CVPR 2026] This repository is the code of our paper "VA-Pi: Variational Policy Alignment for Pixel-Aware Autoregressive Generation"
☆15Mar 3, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhaohengyuan1 / Genixer
View on GitHub
(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator
☆116Mar 21, 2025Updated last year
chenwei746 / EEVG
View on GitHub
☆23Aug 20, 2024Updated last year
LINs-lab / LIE
View on GitHub
[preprint] Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
☆19Feb 18, 2026Updated 5 months ago
rstanjieyi / GeoAI-in-NeurIPS-2024
View on GitHub
A collection of papers related to Geo-spatial Information Science in NeurIPS 2024.
☆56Jan 5, 2025Updated last year
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
lx709 / VRSBench
View on GitHub
☆69Jun 11, 2026Updated last month
EvolvingLMMs-Lab / LLaVA-OneVision-1.5-RL
View on GitHub
Fully Open Framework for Democratized Multimodal Reinforcement Learning.
☆51Dec 19, 2025Updated 7 months ago
kai422 / SCALE
View on GitHub
[ICLR 2024] Scaling for Training Time and Post-hoc Out-of-distribution Detection Enhancement.
☆15Mar 12, 2024Updated 2 years ago
LauraChow77 / GlobalUrbanMapper
View on GitHub
☆29Apr 23, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Yxxxb / VoCo-LLaMA
View on GitHub
[CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".
☆205Jun 18, 2025Updated last year
tiiuae / siglino
View on GitHub
AMoE: Agglomerative Mixture-of-Experts Vision Foundation Models
☆53Jun 11, 2026Updated last month
AILab-CVC / VL-GPT
View on GitHub
VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation
☆86Sep 12, 2024Updated last year
yliu1229 / AlignSeg
View on GitHub
The PyTorch implementation of AlignSeg.
☆21Feb 26, 2025Updated last year
gastruc / OmniSat
View on GitHub
☆98Oct 24, 2024Updated last year
jiaohuix / nmt_data_tools
View on GitHub
machine translation data process tools
☆10Apr 29, 2024Updated 2 years ago
yyh-rain-song / ReMamber
View on GitHub
ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.
☆46Jul 11, 2024Updated 2 years ago