hustvl / CircuitFormer
[NeurIPS 2023] CircuitFormer: Circuit as Set of Points
☆33Updated last year
Alternatives and similar repositories for CircuitFormer:
Users that are interested in CircuitFormer are comparing it to the libraries listed below
- ☆16Updated last year
- ☆10Updated 2 months ago
- MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More☆28Updated 3 months ago
- ☆14Updated last month
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆44Updated last month
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆21Updated 9 months ago
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆61Updated 8 months ago
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆58Updated 2 weeks ago
- DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆117Updated last month
- ☆12Updated 3 months ago
- The official code of the paper "PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction".☆50Updated 2 weeks ago
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Pekin…☆67Updated 3 months ago
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆32Updated 4 months ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Updated last year
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆21Updated 3 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆32Updated 7 months ago
- [NeurIPS 2022] “M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design”, Hanxue …☆101Updated 2 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆24Updated 11 months ago
- PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆36Updated 4 months ago
- PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆22Updated last month
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆43Updated this week
- Official repository of InLine attention (NeurIPS 2024)☆35Updated last month
- [NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks☆102Updated last month
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆68Updated 7 months ago
- Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆37Updated 3 weeks ago
- The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"☆27Updated 3 weeks ago
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆99Updated 10 months ago
- PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos☆35Updated last month
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆40Updated 7 months ago