Learnable Semi-structured Sparsity for Vision Transformers and Diffusion Transformers
☆15Feb 7, 2025Updated last year
Alternatives and similar repositories for MaskLLM-4V
Users that are interested in MaskLLM-4V are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models☆187Jan 1, 2025Updated last year
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- ☆32Oct 4, 2025Updated 6 months ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆129May 22, 2025Updated 11 months ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆84Jul 23, 2024Updated last year
- PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.☆16Jun 4, 2024Updated last year
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆18Dec 5, 2024Updated last year
- DMax: Aggressive Parallel Decoding for dLLMs☆110Apr 20, 2026Updated last week
- Vico: Compositional Video Generation as Flow Equalization☆59Nov 15, 2024Updated last year
- ☆17Dec 11, 2024Updated last year
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆68May 8, 2025Updated 11 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆119Jul 15, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆65Sep 27, 2025Updated 7 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆37Updated this week
- (ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"☆78Feb 13, 2025Updated last year
- ☆13Nov 29, 2024Updated last year
- Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)☆14Jan 8, 2024Updated 2 years ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆44Nov 8, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".☆118May 3, 2025Updated 11 months ago
- Official Pytorch Implementation of Paper "DarwinLM: Evolutionary Structured Pruning of Large Language Models"☆20Feb 21, 2025Updated last year
- [CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow☆165Dec 1, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆15Oct 22, 2024Updated last year
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆63Feb 22, 2026Updated 2 months ago
- 2025最新机场节点购买推荐☆19Apr 2, 2026Updated 3 weeks ago
- MICRO 2023 Evaluation Artifact for TeAAL☆11Oct 26, 2023Updated 2 years ago
- [ACL '26] Source code for paper "Empirical Analysis of Decoding Biases in Masked Diffusion Models"☆39Jan 11, 2026Updated 3 months ago
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆109Sep 27, 2025Updated 7 months ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated last year
- Flash Sculptor: Modular 3D Worlds from Objects☆33Apr 13, 2025Updated last year
- ☆71Nov 18, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem☆21Apr 7, 2026Updated 3 weeks ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- [CVPR'24] Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression☆15Jul 1, 2024Updated last year
- DiTAS: Quantizing Diffusion Transformers via Enhanced Activation Smoothing (WACV 2025)☆13Feb 7, 2026Updated 2 months ago
- ☆79Feb 4, 2025Updated last year
- Accelerator Zoo☆20Oct 14, 2025Updated 6 months ago
- Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper☆17Nov 6, 2025Updated 5 months ago