MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)
☆136Feb 6, 2026Updated last month
Alternatives and similar repositories for MHLA
Users that are interested in MHLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆55Mar 15, 2026Updated 2 weeks ago
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆52Mar 13, 2026Updated 2 weeks ago
- Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation☆115Jan 20, 2026Updated 2 months ago
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆62Mar 7, 2026Updated 3 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20…☆25Dec 18, 2025Updated 3 months ago
- A specialized motion processing pipeline that converts GVHMR's SMPL outputs (.pt) into retargeted PBHC-compatible motions (.pkl), featuri…☆27Mar 19, 2026Updated last week
- Towards Pixel-Level VLM Perception via Simple Points Prediction☆100Feb 9, 2026Updated last month
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆58Mar 12, 2026Updated 2 weeks ago
- TBD☆51Mar 13, 2026Updated 2 weeks ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding"☆63Updated this week
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆46Updated this week
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆173Dec 11, 2025Updated 3 months ago
- X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests☆83Feb 28, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆16Mar 25, 2024Updated 2 years ago
- Open Ended Medical Reinforcement Learning☆39Mar 15, 2026Updated 2 weeks ago
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆40Updated this week
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆27Feb 10, 2026Updated last month
- ☆63Feb 6, 2026Updated last month
- ☆58Jan 30, 2026Updated 2 months ago
- 📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆138Mar 19, 2026Updated last week
- [ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆104Feb 8, 2026Updated last month
- [IJCAI 2025] Offical implementation of the paper "Multi-View Learning with Context-Guided Receptance for Image Denoising".☆12Jun 26, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆13Jul 10, 2024Updated last year
- Official implementation of Log-linear Sparse Attention (LLSA).☆64Feb 2, 2026Updated last month
- the code of GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution☆26May 16, 2024Updated last year
- The code of "HSR-KAN: Hyperspectral Image Super-Resolution based on Kolmogorov-Arnold Networks"☆25Sep 15, 2024Updated last year
- ☆27Jun 22, 2024Updated last year
- official implementation of [PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning, ICCV'25]☆35Oct 31, 2025Updated 4 months ago
- Official implementation of SimFlow☆28Dec 16, 2025Updated 3 months ago
- ☆10Aug 29, 2024Updated last year
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆28Dec 12, 2025Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- PyTorch reimplementation of Noise2Same with enhancements☆11Mar 6, 2026Updated 3 weeks ago
- Official repository of Siggraph Asia 2025 paper "LSF-Animation: Label-Free Speech-Driven Facial Animation via Implicit Feature Representa…☆26Dec 24, 2025Updated 3 months ago
- "Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning"☆60Jan 28, 2026Updated 2 months ago
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 2 months ago
- ☆13Feb 28, 2025Updated last year
- ☆11Aug 13, 2024Updated last year
- The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".☆114Feb 12, 2026Updated last month