LeapLabTHU / Attention-MediatorsLinks

[ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

☆46

Alternatives and similar repositories for Attention-Mediators

Users that are interested in Attention-Mediators are comparing it to the libraries listed below

Sorting:

LeapLabTHU / AdaNAT
[ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
☆34Updated last year
LeapLabTHU / ENAT
[NeurIPS 2024] ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis
☆24Updated last year
LeapLabTHU / ViTTT
Official repository of Vision Test-Time Training
☆39Updated this week
LeapLabTHU / Dynamic_Perceiver
Official implementation of Dynamic Perceiver
☆43Updated 2 years ago
LeapLabTHU / ImprovedNAT
A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"
☆46Updated last year
LeapLabTHU / AdaptiveNN
[Nature Machine Intelligence 2025] Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception
☆106Updated 2 weeks ago
LeapLabTHU / InLine
Official repository of InLine attention (NeurIPS 2024)
☆56Updated 11 months ago
LeapLabTHU / LAUDNet
[IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition
☆52Updated 8 months ago
LeapLabTHU / CODA
CODA: Repurposing Continuous VAEs for Discrete Tokenization
☆34Updated 5 months ago
LeapLabTHU / LASNet
[NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks
☆24Updated 2 years ago
LeapLabTHU / UniTTA
☆18Updated 9 months ago
LeapLabTHU / Uni-AdaFocus
Official repository of Uni-AdaFocus (TPAMI 2024).
☆54Updated 11 months ago
LeapLabTHU / CheckpointKD
☆27Updated 3 years ago
LeapLabTHU / SimPro
[ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learning
☆31Updated last year
techmonsterwang / iLLaMA
Adapting LLaMA Decoder to Vision Transformer
☆30Updated last year
NUS-HPC-AI-Lab / SGL
☆29Updated 9 months ago
LeapLabTHU / LearnableISDA
[IEEE TIP] Fine-grained Recognition with Learnable Semantic Data Augmentation
☆30Updated last year
NUS-HPC-AI-Lab / Dynamic-Tuning
The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"
☆51Updated 11 months ago
OliverRensu / ARM
[ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
☆87Updated 6 months ago
LeapLabTHU / Deep-Incubation
Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)
☆90Updated 2 years ago
OpenGVLab / De-focus-Attention-Networks
Learning 1D Causal Visual Representation with De-focus Attention Networks
☆35Updated last year
OpenGVLab / Mono-InternVL
[CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
☆96Updated 4 months ago
LeapLabTHU / diver-ct
☆13Updated 11 months ago
SHI-Labs / IMG-Multimodal-Diffusion-Alignment
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025
☆28Updated 2 months ago
yu-rp / Dimple
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆112Updated 5 months ago
LeapLabTHU / DAT-Jittor
Jittor implementation of Vision Transformer with Deformable Attention
☆31Updated 3 years ago
BIT-DA / ABS
[ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection
☆24Updated 5 months ago
Adlith / MoE-Jetpack
[NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks
☆132Updated last year
Martinser / REG
[NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think
☆198Updated 2 months ago
Haochen-Wang409 / ross
[ICLR'25] Reconstructive Visual Instruction Tuning
☆129Updated 8 months ago