NoakLiu / LLMEasyQuant
An Easy-to-Use Toolkit for LLM Quantization on can be executed on Macbook [Efficient ML Model]
☆15Updated 3 weeks ago
Alternatives and similar repositories for LLMEasyQuant:
Users that are interested in LLMEasyQuant are comparing it to the libraries listed below
- GraphSnapShot: Caching Local Structure for Fast Graph Learning [Efficient ML System]☆30Updated 2 months ago
- Accelerating Embedding Training on Multitask Scenario [Efficient ML Model]☆11Updated last month
- Efficient-Large-Foundation-Model-Inference: A-Perspective-From-Model-and-System-Co-Design [Efficient ML System & Model]☆19Updated 2 weeks ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆40Updated last month
- [EMNLP 2024 Findings🔥] Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Infe…☆89Updated 2 months ago
- 🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".☆15Updated last month
- The official code implementation of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆24Updated this week
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Pekin…☆71Updated 3 months ago
- [ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.☆65Updated last year
- MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer☆36Updated 4 months ago
- ☆17Updated 5 months ago
- Survey on Data-centric Large Language Models☆73Updated 6 months ago
- A paper list of some recent works about Token Compress for Vit and VLM☆293Updated this week
- [NeurIPS'24] Official implementation of paper "Unveiling the Tapestry of Consistency in Large Vision-Language Models".☆34Updated 3 months ago
- Survey and Benchmark of VIALM☆9Updated last year
- ☆25Updated 7 months ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆20Updated 7 months ago
- Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data☆19Updated 5 months ago
- An end-to-end benchmark suite of multi-modal DNN applications for system-architecture co-design☆22Updated last month
- This is the official repo for Contrastive Vision-Language Alignment Makes Efficient Instruction Learner.☆20Updated last year
- ☆42Updated last month
- 🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆65Updated last month
- [ICML 2024] CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers.☆30Updated last month
- The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".☆207Updated last week
- [TPAMI reviewing] Towards Visual Grounding: A Survey☆58Updated 2 weeks ago
- A collection of visual instruction tuning datasets.☆76Updated 10 months ago
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆45Updated last month
- CVPR2024 highlight.☆13Updated 3 months ago
- Pruning the VLLMs☆79Updated last month
- Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"☆226Updated last month