jeho-lee / Awesome-Efficient-AI
☆12Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Efficient-AI
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆55Updated 5 months ago
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆83Updated 3 months ago
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆53Updated 8 months ago
- All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai☆137Updated 11 months ago
- [DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation☆11Updated last year
- [CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric☆52Updated last year
- ☆44Updated last week
- ☆195Updated 3 years ago
- Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.☆39Updated last week
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆20Updated 3 years ago
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆64Updated last year
- [DAC 2024] EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive La…☆33Updated 4 months ago
- ☆83Updated 7 months ago
- ☆39Updated 3 weeks ago
- Official Implementation of "Genie: Show Me the Data for Quantization" (CVPR 2023)☆17Updated last year
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.☆19Updated 2 weeks ago
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆56Updated 4 months ago
- [NeurIPS 2023] Token-Scaled Logit Distillation for Ternary Weight Generative Language Models☆17Updated 11 months ago
- Post-Training Quantization for Vision transformers.☆191Updated 2 years ago
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…☆155Updated 3 weeks ago
- ☆27Updated last year
- The official implementation of TinyTrain [ICML '24]☆20Updated 4 months ago
- Code Repository of Evaluating Quantized Large Language Models☆103Updated 2 months ago
- A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..☆166Updated 3 months ago
- Layer-wise Pruning of Transformer Heads for Efficient Language Modeling☆21Updated 2 years ago
- Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".☆104Updated last year
- Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]☆65Updated 2 months ago
- Awesome list for LLM pruning.☆169Updated this week
- ☆18Updated 8 months ago
- List of papers related to neural network quantization in recent AI conferences and journals.☆460Updated 2 months ago