jeho-lee / Awesome-Efficient-AI

☆12

Related projects ⓘ

Alternatives and complementary repositories for Awesome-Efficient-AI

DD-DuDa / awesome-vit-quantization-acceleration
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
☆55Updated 5 months ago
SNU-ARC / any-precision-llm
[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs
☆83Updated 3 months ago
xvyaward / owq
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…
☆53Updated 8 months ago
yifanlu0227 / MIT-6.5940
All Homeworks for TinyML and Efficient Deep Learning Computing 6.5940 • Fall • 2023 • https://efficientml.ai
☆137Updated 11 months ago
hongsunjang / pipe-bd
[DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation
☆11Updated last year
hustvl / PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆52Updated last year
junstar92 / nvidia-libraries-study
☆44Updated last week
Qualcomm-AI-research / transformer-quantization
☆195Updated 3 years ago
liyunqianggyn / Awesome-LLMs-Pruning
Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.
☆39Updated last week
ztt-21 / zTT
zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation
☆20Updated 3 years ago
liangyn22 / MCUFormer
[NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory
☆64Updated last year
GATECH-EIC / Edge-LLM
[DAC 2024] EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive La…
☆33Updated 4 months ago
efficient-ai-study / efficient-ai-study
☆83Updated 7 months ago
TianjinYellow / EdgeDeviceLLMCompetition-Starting-Kit
☆39Updated 3 weeks ago
SamsungLabs / Genie
Official Implementation of "Genie: Show Me the Data for Quantization" (CVPR 2023)
☆17Updated last year
DeepAuto-AI / hip-attention
Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
☆19Updated 2 weeks ago
chengtao-lv / PTQ4SAM
[CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything
☆56Updated 4 months ago
aiha-lab / TSLD
[NeurIPS 2023] Token-Scaled Logit Distillation for Ternary Weight Generative Language Models
☆17Updated 11 months ago
hahnyuan / PTQ4ViT
Post-Training Quantization for Vision transformers.
☆191Updated 2 years ago
Efficient-ML / Awesome-Efficient-LLM-Diffusion
A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…
☆155Updated 3 weeks ago
SteveTsui / Q-DETR
☆27Updated last year
theyoungkwon / TinyTrain
The official implementation of TinyTrain [ICML '24]
☆20Updated 4 months ago
thu-nics / qllm-eval
Code Repository of Evaluating Quantized Large Language Models
☆103Updated 2 months ago
Macaronlin / LLaMA3-Quantization
A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..
☆166Updated 3 months ago
aiha-lab / Attention-Head-Pruning
Layer-wise Pruning of Transformer Heads for Efficient Language Modeling
☆21Updated 2 years ago
IST-DASLab / OBC
Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".
☆104Updated last year
Nota-NetsPresso / shortened-llm
Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]
☆65Updated 2 months ago
pprp / Awesome-LLM-Prune
Awesome list for LLM pruning.
☆169Updated this week
Qualcomm-AI-research / pruning-vs-quantization
☆18Updated 8 months ago
Zhen-Dong / Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
☆460Updated 2 months ago