[ICLR 2026π₯] MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head
β151May 19, 2026Updated last month
Alternatives and similar repositories for MHLA
Users that are interested in MHLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".β13Jan 25, 2025Updated last year
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"β60Mar 13, 2026Updated 3 months ago
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignmentβ53Mar 24, 2026Updated 2 months ago
- Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentationβ175Updated this week
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"β92Updated this week
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficientlyβ39Feb 4, 2026Updated 4 months ago
- Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20β¦β32Mar 29, 2026Updated 2 months ago
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streamsβ107Mar 15, 2026Updated 3 months ago
- [ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".β38Mar 5, 2026Updated 3 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.β57Mar 12, 2026Updated 3 months ago
- Towards Pixel-Level VLM Perception via Simple Points Predictionβ104Feb 9, 2026Updated 4 months ago
- MΒ³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAMβ76Mar 18, 2026Updated 3 months ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]β90May 8, 2026Updated last month
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Controlβ187Dec 11, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- TBDβ60Mar 13, 2026Updated 3 months ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generationβ20May 2, 2025Updated last year
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbacβ¦β63Mar 25, 2026Updated 2 months ago
- β16Mar 25, 2024Updated 2 years ago
- UniMesh: Unifying 3D Mesh Understanding and Generationβ57May 8, 2026Updated last month
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selectionβ35Jun 7, 2026Updated last week
- β88May 8, 2026Updated last month
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)β16Jan 7, 2025Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformerβ15Sep 7, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)β48Apr 7, 2026Updated 2 months ago
- β70Feb 6, 2026Updated 4 months ago
- [IJCAI 2025] Offical implementation of the paper "Multi-View Learning with Context-Guided Receptance for Image Denoising".β13Jun 26, 2025Updated 11 months ago
- [ICLRβ26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Controlβ108Feb 8, 2026Updated 4 months ago
- the code of GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolutionβ26May 16, 2024Updated 2 years ago
- The code of "HSR-KAN: Hyperspectral Image Super-Resolution based on Kolmogorov-Arnold Networks"β25Sep 15, 2024Updated last year
- π· [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!β186May 15, 2026Updated last month
- β27Jun 22, 2024Updated last year
- [CVPR 2026 Highlight] Official implementation of Log-linear Sparse Attention (LLSA).β86May 1, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of SimFlowβ32Dec 16, 2025Updated 6 months ago
- β10Aug 29, 2024Updated last year
- β37Jan 30, 2026Updated 4 months ago
- β44Updated this week
- The code for paper "Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry", accβ¦β217Feb 3, 2026Updated 4 months ago
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"β33Dec 12, 2025Updated 6 months ago
- PyTorch reimplementation of Noise2Same with enhancementsβ12Mar 6, 2026Updated 3 months ago