[ICLR 2026🔥] MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head
☆149May 19, 2026Updated last week
Alternatives and similar repositories for MHLA
Users that are interested in MHLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆59Mar 13, 2026Updated 2 months ago
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆50Mar 24, 2026Updated 2 months ago
- Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation☆165Jan 20, 2026Updated 4 months ago
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆86May 21, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆39Feb 4, 2026Updated 3 months ago
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆100Mar 15, 2026Updated 2 months ago
- Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20…☆30Mar 29, 2026Updated 2 months ago
- A specialized motion processing pipeline that converts GVHMR's SMPL outputs (.pt) into retargeted PBHC-compatible motions (.pkl), featuri…☆29Mar 19, 2026Updated 2 months ago
- [ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".☆36Mar 5, 2026Updated 2 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated 2 months ago
- Towards Pixel-Level VLM Perception via Simple Points Prediction☆104Feb 9, 2026Updated 3 months ago
- M³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM☆72Mar 18, 2026Updated 2 months ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]☆86May 8, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [AAAI 2026] Official repository of Circulant Attention☆58Jan 12, 2026Updated 4 months ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆20May 2, 2025Updated last year
- X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests☆56Feb 28, 2026Updated 3 months ago
- ☆16Mar 25, 2024Updated 2 years ago
- UniMesh: Unifying 3D Mesh Understanding and Generation☆56May 8, 2026Updated 3 weeks ago
- ☆83May 8, 2026Updated 3 weeks ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- [ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)☆48Apr 7, 2026Updated last month
- ☆69Feb 6, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆55Jan 30, 2026Updated 4 months ago
- [IJCAI 2025] Offical implementation of the paper "Multi-View Learning with Context-Guided Receptance for Image Denoising".☆12Jun 26, 2025Updated 11 months ago
- ☆13Jul 10, 2024Updated last year
- CPU source code for NSCSCC 2023☆14Aug 26, 2023Updated 2 years ago
- The code of "HSR-KAN: Hyperspectral Image Super-Resolution based on Kolmogorov-Arnold Networks"☆25Sep 15, 2024Updated last year
- ☆27Jun 22, 2024Updated last year
- 📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆167May 15, 2026Updated 2 weeks ago
- The code for paper "Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry", acc…☆144Feb 3, 2026Updated 3 months ago
- ☆36Jan 30, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of SimFlow☆31Dec 16, 2025Updated 5 months ago
- ☆10Aug 29, 2024Updated last year
- [ICML 2026] The official implementation of paper "Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation…☆72Updated this week
- SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention☆308Feb 24, 2026Updated 3 months ago
- ☆43Feb 26, 2026Updated 3 months ago
- Official repo for paper "HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies"☆32Dec 12, 2025Updated 5 months ago
- PyTorch reimplementation of Noise2Same with enhancements☆11Mar 6, 2026Updated 2 months ago