feihongyan1 / LazyMARLinks
[ICCV2025]LazyMAR: Accelerating Masked Autoregressive Models via Feature Caching
☆46Updated 2 months ago
Alternatives and similar repositories for LazyMAR
Users that are interested in LazyMAR are comparing it to the libraries listed below
Sorting:
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆83Updated 2 months ago
- Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficien…☆135Updated last week
- The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink…☆754Updated 3 weeks ago
- VisionSelector: End-to-End Learnable Visual Token Compression for Efficient Multimodal LLMs☆48Updated 2 months ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆52Updated last year
- assistant tools for attention visualization in deep learning☆28Updated 3 years ago
- [AAAI-2025] The offical code for SiTo (Similarity-based Token Pruning for Stable Diffusion Models)☆40Updated 7 months ago
- toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts☆27Updated last year
- [CVPR 2025] DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models☆63Updated last month
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆69Updated 6 months ago
- ☆42Updated last year
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆164Updated 6 months ago
- This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel s…☆52Updated last month
- ☆152Updated last year
- [EMNLP 2025 main 🔥] Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆97Updated 2 months ago
- 🕹️The toy examples of Kolmogorov-Arnold Network (Get Started Quickly)☆75Updated last year
- [ICML'25] Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" and "Sp…☆217Updated 2 weeks ago
- [CVPR'24] Official implementation of paper "FreeKD: Knowledge Distillation via Semantic Frequency Prompt".☆49Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆234Updated last year
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆108Updated last year
- Automatically update arXiv papers about SOT & VLT, Multi-modal Learning, LLM and Video Understanding using Github Actions.☆40Updated this week
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆129Updated last year
- SmartCLIP: A training method to improve CLIP with both short and long texts☆36Updated 6 months ago
- [AAAI 2025] PAT: Pruning-Aware Tuning for Large Language Models☆36Updated 11 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆79Updated 2 months ago
- [NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks☆133Updated last year
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆105Updated last year
- ☆221Updated 10 months ago
- ☆92Updated 2 years ago
- Some experiences for new researchers to grow grow up☆43Updated 2 years ago