DAGroup-PKU / MHLALinks
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head
☆68Updated last week
Alternatives and similar repositories for MHLA
Users that are interested in MHLA are comparing it to the libraries listed below
Sorting:
- CoV: Chain-of-View Prompting for Spatial Reasoning☆33Updated last week
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆31Updated 2 weeks ago
- 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆93Updated this week
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Updated last year
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆19Updated 2 weeks ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Updated 4 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Updated 10 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Updated 8 months ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆56Updated 3 months ago
- Official Repo for Self-Forcing++ High Quality Long Video Generation☆225Updated 3 months ago
- The official implementation of The paper "Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation"☆94Updated 3 weeks ago
- ☆52Updated 2 weeks ago
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Updated 3 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆107Updated 10 months ago
- Test-time Scaling for VAR models☆29Updated 4 months ago
- Offical implementation of "Auto-Regressively Generating Multi-View Consistent Images". (ICCV 2025)☆81Updated 5 months ago
- ☆141Updated 3 months ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆113Updated 3 months ago
- ☆52Updated last year
- VideoAuteur: Towards Long Narrative Video Generation☆43Updated 2 months ago
- Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".☆120Updated last month
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆38Updated 5 months ago
- Self-reimplemented version of 4D-LRM.☆65Updated 7 months ago
- Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation☆102Updated last month
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆36Updated last month
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆203Updated 3 months ago
- [AAAI 2026] GenMAC for Compositional Text-to-Video Generation☆31Updated last week
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆166Updated 3 months ago
- Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆158Updated last month
- [ICCV'25] Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆78Updated 6 months ago