MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head (ICLR 2026)
☆125Feb 6, 2026Updated last month
Alternatives and similar repositories for MHLA
Users that are interested in MHLA are comparing it to the libraries listed below
Sorting:
- [CVPR 2026] Official code of "EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding"☆38Updated this week
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆30Feb 24, 2026Updated last week
- ☆51Updated this week
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated 3 weeks ago
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆167Dec 11, 2025Updated 2 months ago
- Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20…☆24Dec 18, 2025Updated 2 months ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆33Feb 4, 2026Updated last month
- A specialized motion processing pipeline that converts GVHMR's SMPL outputs (.pt) into retargeted PBHC-compatible motions (.pkl), featuri…☆26Aug 19, 2025Updated 6 months ago
- ☆13Jul 10, 2024Updated last year
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆17May 21, 2025Updated 9 months ago
- ☆11Aug 13, 2024Updated last year
- Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation☆99Jan 20, 2026Updated last month
- ☆45Feb 25, 2026Updated last week
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding"☆59Jan 23, 2026Updated last month
- Towards Pixel-Level VLM Perception via Simple Points Prediction☆93Feb 9, 2026Updated last month
- ☆16Mar 25, 2024Updated last year
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆19Sep 22, 2025Updated 5 months ago
- A unified robotic manipulation learning framework☆21Sep 4, 2025Updated 6 months ago
- [CVPR2025] Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation☆18May 2, 2025Updated 10 months ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆32Dec 13, 2025Updated 2 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆55Aug 16, 2025Updated 6 months ago
- [ICLR 26] Part-X-MLLM: Part-aware 3D Multimodal Large Language Model☆111Jan 26, 2026Updated last month
- 📷 [CVPR'26] Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆126Feb 21, 2026Updated 2 weeks ago
- TBD☆42Feb 3, 2026Updated last month
- A Unified Visual Generator with Interleaved OmniModal Context☆192Feb 10, 2026Updated 3 weeks ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆39Jan 23, 2026Updated last month
- The code of "HSR-KAN: Hyperspectral Image Super-Resolution based on Kolmogorov-Arnold Networks"☆25Sep 15, 2024Updated last year
- [ICME 2025] DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation☆24Mar 25, 2025Updated 11 months ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆56Feb 11, 2026Updated 3 weeks ago
- the code of GRFormer: Grouped Residual Self-Attention for Lightweight Single Image Super-Resolution☆26May 16, 2024Updated last year
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated 2 weeks ago
- DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models☆171Jan 4, 2026Updated 2 months ago
- ☆26Jun 22, 2024Updated last year
- ReMoMask: Retrieval-Augmented Masked Motion Generation☆39Feb 14, 2026Updated 3 weeks ago
- Self-reimplemented version of 4D-LRM.☆65May 30, 2025Updated 9 months ago
- [AAAI 2026] SlideTailor: Personalized Presentation Slide Generation for Scientific Papers☆45Jan 1, 2026Updated 2 months ago
- ☆55Jan 30, 2026Updated last month
- ☆28Apr 8, 2025Updated 11 months ago