LeapLabTHU / InLineLinks

Official repository of InLine attention (NeurIPS 2024)

☆56

Alternatives and similar repositories for InLine

Users that are interested in InLine are comparing it to the libraries listed below

Sorting:

qhfan / RALA
[CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention
☆37Updated 8 months ago
EasonXiao-888 / MambaTree
[NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model
☆102Updated last year
LeapLabTHU / Attention-Mediators
[ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
☆46Updated last year
OliverRensu / ARM
[ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
☆87Updated 6 months ago
ChenhongyiYang / PlainMamba
[BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition
☆87Updated 8 months ago
liuting20 / Sparse-Tuning
☆30Updated last year
LeapLabTHU / AdaNAT
[ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
☆34Updated last year
ysj9909 / FFNet
FFNet: MetaMixer-based Efficient Convolutional Mixer Design
☆31Updated 8 months ago
AILab-CVC / M2PT
[CVPR 2024] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
☆100Updated last year
LeapLabTHU / LAUDNet
[IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition
☆52Updated 8 months ago
Adlith / MoE-Jetpack
[NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
☆132Updated last year
wangf3014 / Mamba-Reg
☆78Updated 9 months ago
rayleizhu / GLMix
[NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".
☆42Updated 10 months ago
NUS-HPC-AI-Lab / SGL
☆29Updated 9 months ago
OpenGVLab / Mono-InternVL
[CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
☆96Updated 4 months ago
csguoh / FastVAR
[ICCV2025]Generate one 2K image on single 3090 GPU!
☆78Updated 3 months ago
ZacharyMeng / PolaFormer
Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)
☆80Updated last month
OpenGVLab / PVC
[CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
☆49Updated 5 months ago
LeapLabTHU / ImprovedNAT
A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"
☆46Updated last year
czg1225 / CoDe
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆107Updated 2 months ago
YouHuang67 / mamba-code-explained
☆18Updated last year
AIoT-MLSys-Lab / Famba-V
[ECCV 2024 Workshop Best Paper Award] Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion
☆34Updated last year
w1oves / hqclip
[ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets
☆57Updated 4 months ago
OpenGVLab / De-focus-Attention-Networks
Learning 1D Causal Visual Representation with De-focus Attention Networks
☆35Updated last year
techmonsterwang / iLLaMA
Adapting LLaMA Decoder to Vision Transformer
☆30Updated last year
yu-rp / Dimple
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆112Updated 5 months ago
LeapLabTHU / Uni-AdaFocus
Official repository of Uni-AdaFocus (TPAMI 2024).
☆54Updated 11 months ago
hustvl / ViG
[AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention
☆116Updated last year
yongliu20 / SCAN
[CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"
☆75Updated last year
LeapLabTHU / EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…
☆225Updated last year