DAGroup-PKU/MHLA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DAGroup-PKU/MHLA)

DAGroup-PKU / MHLA

[ICLR 2026🔥] MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

☆149

Alternatives and similar repositories for MHLA

Users that are interested in MHLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lihongcs / LLM_Inception
View on GitHub
[ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".
☆13Jan 25, 2025Updated last year
HKU-MMLab / EVATok
View on GitHub
[CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"
☆59Mar 13, 2026Updated 3 months ago
AIM-Research-Lab / Medical-SAM3
View on GitHub
Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation
☆188Jun 21, 2026Updated 2 weeks ago
GAIR-NLP / daVinci-Agency
View on GitHub
daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently
☆38Feb 4, 2026Updated 5 months ago
RobinWitch / MECo
View on GitHub
Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20…
☆32Mar 29, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Go2Heart / OmniStream
View on GitHub
OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams
☆108Mar 15, 2026Updated 3 months ago
Book15011 / GVHMR2PBHC
View on GitHub
A specialized motion processing pipeline that converts GVHMR's SMPL outputs (.pt) into retargeted PBHC-compatible motions (.pkl), featuri…
☆30Mar 19, 2026Updated 3 months ago
0nandon / EmbodiedSplat
View on GitHub
[CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"
☆103Jun 28, 2026Updated last week
Inso-13 / ArtHOI
View on GitHub
[ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".
☆40Mar 5, 2026Updated 4 months ago
yuezhouhu / residual-context-diffusion
View on GitHub
[ICML 2026] Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.
☆58Jun 28, 2026Updated last week
songtianhui / SimpleSeg
View on GitHub
Towards Pixel-Level VLM Perception via Simple Points Prediction
☆105Feb 9, 2026Updated 5 months ago
InternRobotics / M3
View on GitHub
M³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM
☆82Mar 18, 2026Updated 3 months ago
TQTQliu / Light-X
View on GitHub
[ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control
☆188Dec 11, 2025Updated 6 months ago
teqkilla / RubricHub
View on GitHub
TBD
☆63Mar 13, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LeapLabTHU / Circulant-Attention
View on GitHub
[AAAI 2026] Official repository of Circulant Attention
☆60Jun 26, 2026Updated last week
ypwang61 / StoryEval
View on GitHub
[CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation
☆20May 2, 2025Updated last year
MurrayTom / ToolSafe
View on GitHub
Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…
☆69Mar 25, 2026Updated 3 months ago
AIGeeksGroup / UniMesh
View on GitHub
UniMesh: Unifying 3D Mesh Understanding and Generation
☆57May 8, 2026Updated 2 months ago
showlab / FocusUI
View on GitHub
[CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
☆35Jun 7, 2026Updated last month
jiwoogit / SeaCache
View on GitHub
[CVPR 2026 Oral, Best Paper Finalist] SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models
☆83Jun 29, 2026Updated last week
VINHYU / OpenSpatial
View on GitHub
☆88May 8, 2026Updated 2 months ago
Benjamin-Walker / selective-ssms-and-linear-cdes
View on GitHub
Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)
☆17Jan 7, 2025Updated last year
ChuanyangZheng / L2ViT
View on GitHub
Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer
☆15Sep 7, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aim-uofa / EvoTokenDLM
View on GitHub
[ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)
☆48Apr 7, 2026Updated 3 months ago
HerzogFL / World-Craft
View on GitHub
☆56Jan 30, 2026Updated 5 months ago
microsoft / InfoAgent
View on GitHub
☆69Feb 6, 2026Updated 5 months ago
applese233 / ICRL
View on GitHub
In-Context Reinforcement Learning for Tool Use in Large Language Models
☆49Mar 26, 2026Updated 3 months ago
KlingAIResearch / RoboMaster
View on GitHub
[ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
☆107Feb 8, 2026Updated 5 months ago
ShadowBbBb / Depthor
View on GitHub
☆29Aug 1, 2025Updated 11 months ago
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated last year
Invalid-Syntax-NSCSCC / invalid-cpu
View on GitHub
CPU source code for NSCSCC 2023
☆14Aug 26, 2023Updated 2 years ago
sisrformer / GRFormer
View on GitHub
the code of GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution
☆26May 16, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Baisonm-Li / HSR-KAN
View on GitHub
The code of "HSR-KAN: Hyperspectral Image Super-Resolution based on Kolmogorov-Arnold Networks"
☆25Sep 15, 2024Updated last year
chengzhag / UCPE
View on GitHub
📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!
☆202May 15, 2026Updated last month
weixi-feng / TC-Bench
View on GitHub
☆27Jun 22, 2024Updated 2 years ago
SingleZombie / LLSA
View on GitHub
[CVPR 2026 Highlight] Official implementation of Log-linear Sparse Attention (LLSA).
☆86May 1, 2026Updated 2 months ago
ByteDance-Seed / SimFlow
View on GitHub
Official implementation of SimFlow
☆32Dec 16, 2025Updated 6 months ago
BrianChen1120 / RL-AWB
View on GitHub
☆37Jun 19, 2026Updated 2 weeks ago
thu-ml / SLA
View on GitHub
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention
☆318Feb 24, 2026Updated 4 months ago