MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)
☆141Feb 6, 2026Updated 2 months ago
Alternatives and similar repositories for MHLA
Users that are interested in MHLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆36Mar 24, 2026Updated 3 weeks ago
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆67Mar 7, 2026Updated last month
- A specialized motion processing pipeline that converts GVHMR's SMPL outputs (.pt) into retargeted PBHC-compatible motions (.pkl), featuri…☆27Mar 19, 2026Updated 3 weeks ago
- [ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".☆31Mar 5, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Towards Pixel-Level VLM Perception via Simple Points Prediction☆104Feb 9, 2026Updated 2 months ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]☆69Updated this week
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆54Mar 25, 2026Updated 3 weeks ago
- TBD☆53Mar 13, 2026Updated last month
- [AAAI 2026] Official repository of Circulant Attention☆47Jan 12, 2026Updated 3 months ago
- X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests☆63Feb 28, 2026Updated last month
- ☆16Mar 25, 2024Updated 2 years ago
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆44Mar 26, 2026Updated 3 weeks ago
- ☆55Apr 9, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)☆46Apr 7, 2026Updated last week
- Open Ended Medical Reinforcement Learning☆45Mar 15, 2026Updated last month
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆16Jan 7, 2025Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- ☆54Jan 30, 2026Updated 2 months ago
- [IJCAI 2025] Offical implementation of the paper "Multi-View Learning with Context-Guided Receptance for Image Denoising".☆12Jun 26, 2025Updated 9 months ago
- ☆13Jul 10, 2024Updated last year
- 📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆151Apr 12, 2026Updated last week
- the code of GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution☆26May 16, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CPU source code for NSCSCC 2023☆14Aug 26, 2023Updated 2 years ago
- The code of "HSR-KAN: Hyperspectral Image Super-Resolution based on Kolmogorov-Arnold Networks"☆25Sep 15, 2024Updated last year
- Official implementation of Log-linear Sparse Attention (LLSA).☆70Feb 2, 2026Updated 2 months ago
- ☆27Jun 22, 2024Updated last year
- ☆35Jan 30, 2026Updated 2 months ago
- ☆43Mar 23, 2026Updated 3 weeks ago
- MegaFlow: Zero-Shot Large Displacement Optical Flow☆104Mar 28, 2026Updated 3 weeks ago
- Official implementation of SimFlow☆31Dec 16, 2025Updated 4 months ago
- ☆10Aug 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention☆295Feb 24, 2026Updated last month
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆28Dec 12, 2025Updated 4 months ago
- ☆11Aug 13, 2024Updated last year
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆62Jan 28, 2026Updated 2 months ago
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 3 months ago
- A unified robotic manipulation learning framework☆21Sep 4, 2025Updated 7 months ago
- Code for ICLR 2024 paper "Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection"☆17Apr 20, 2024Updated last year