MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)
☆144Apr 17, 2026Updated 3 weeks ago
Alternatives and similar repositories for MHLA
Users that are interested in MHLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆43Mar 24, 2026Updated last month
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆57Mar 13, 2026Updated last month
- Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation☆159Jan 20, 2026Updated 3 months ago
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆91Mar 15, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆80Mar 7, 2026Updated 2 months ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆39Feb 4, 2026Updated 3 months ago
- Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20…☆29Mar 29, 2026Updated last month
- A specialized motion processing pipeline that converts GVHMR's SMPL outputs (.pt) into retargeted PBHC-compatible motions (.pkl), featuri…☆28Mar 19, 2026Updated last month
- [ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".☆33Mar 5, 2026Updated 2 months ago
- Towards Pixel-Level VLM Perception via Simple Points Prediction☆104Feb 9, 2026Updated 3 months ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]☆83Updated this week
- [AAAI 2026] Official repository of Circulant Attention☆52Jan 12, 2026Updated 3 months ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆20May 2, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests☆64Feb 28, 2026Updated 2 months ago
- ☆16Mar 25, 2024Updated 2 years ago
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆46Mar 26, 2026Updated last month
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆32Feb 10, 2026Updated 2 months ago
- ☆77Updated this week
- UniMesh: Unifying 3D Mesh Understanding and Generation☆54Apr 29, 2026Updated last week
- [ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)☆48Apr 7, 2026Updated last month
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆16Jan 7, 2025Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆53Jan 30, 2026Updated 3 months ago
- Open Ended Medical Reinforcement Learning☆52Mar 15, 2026Updated last month
- [IJCAI 2025] Offical implementation of the paper "Multi-View Learning with Context-Guided Receptance for Image Denoising".☆12Jun 26, 2025Updated 10 months ago
- [ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆105Feb 8, 2026Updated 3 months ago
- ☆29Aug 1, 2025Updated 9 months ago
- ☆13Jul 10, 2024Updated last year
- the code of GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution☆26May 16, 2024Updated last year
- CPU source code for NSCSCC 2023☆14Aug 26, 2023Updated 2 years ago
- 📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆163Apr 27, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The code of "HSR-KAN: Hyperspectral Image Super-Resolution based on Kolmogorov-Arnold Networks"☆25Sep 15, 2024Updated last year
- ☆27Jun 22, 2024Updated last year
- official implementation of [PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning, ICCV'25]☆35Oct 31, 2025Updated 6 months ago
- ☆36Jan 30, 2026Updated 3 months ago
- ☆10Aug 29, 2024Updated last year
- SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention☆305Feb 24, 2026Updated 2 months ago
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆32Dec 12, 2025Updated 4 months ago