tiingweii-shii / Awesome-Resource-Efficient-LLM-PapersView external linksLinks
a curated list of high-quality papers on resource-efficient LLMs 🌱
☆156Mar 15, 2025Updated 11 months ago
Alternatives and similar repositories for Awesome-Resource-Efficient-LLM-Papers
Users that are interested in Awesome-Resource-Efficient-LLM-Papers are comparing it to the libraries listed below
Sorting:
- ☆38Jan 15, 2021Updated 5 years ago
- [TMLR 2024] Efficient Large Language Models: A Survey☆1,253Jun 23, 2025Updated 7 months ago
- A curated list for Efficient Large Language Models☆1,951Jun 17, 2025Updated 8 months ago
- SFS: A Smart OS Scheduler for Serverless Function Workloads (SC'22)☆13Dec 15, 2022Updated 3 years ago
- Large Language Model (LLM) Systems Paper List☆1,818Feb 8, 2026Updated last week
- ☆52Dec 13, 2022Updated 3 years ago
- (ICCV 2023) Official implementation of Rectified Straight Through Estimator (ReSTE).☆31Sep 20, 2024Updated last year
- Customized Inference Engine for Multiverse Models☆24Jun 27, 2025Updated 7 months ago
- These are papers that I read and reviewed related to NLP, CV, and Deep Learning 😉 You can check paper links and my reviews 😊☆13Jan 3, 2024Updated 2 years ago
- ☆20Nov 20, 2024Updated last year
- Reading notes on Speculative Decoding papers☆21Dec 8, 2025Updated 2 months ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆17May 19, 2025Updated 8 months ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆12Mar 7, 2024Updated last year
- The official implementation of the DAC 2024 paper GQA-LUT☆20Dec 20, 2024Updated last year
- ☆17Dec 3, 2020Updated 5 years ago
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless inte…☆24Nov 28, 2024Updated last year
- Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark☆33Aug 20, 2025Updated 5 months ago
- ☆42Dec 15, 2022Updated 3 years ago
- Summary of some awesome work for optimizing LLM inference☆176Updated this week
- Automatically Update LLM-Agent Papers Daily using Github Actions (Update Every 12th hours)☆19Updated this week
- This is the official repository for NeurIPS 2023 paper "Curriculum Learning for Graph Neural Networks: Which Edges Should We Learn First"☆17Oct 27, 2023Updated 2 years ago
- 🔮 LLM GPU Calculator☆21Aug 19, 2023Updated 2 years ago
- 📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉☆4,990Jan 18, 2026Updated last month
- Awesome LLM compression research papers and tools.☆1,776Nov 10, 2025Updated 3 months ago
- Accelerating Deep Learning Training Through Transparent Storage Tiering (CCGrid'22)☆19Dec 13, 2022Updated 3 years ago
- The benchmark proposed in paper: GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability☆23Aug 12, 2025Updated 6 months ago
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…☆204Feb 10, 2025Updated last year
- [COLM 2024] SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models☆25Oct 5, 2024Updated last year
- [CVPR 2025] Offical implementation of the paper "Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters The…☆31Feb 27, 2025Updated 11 months ago
- Code Repository of Evaluating Quantized Large Language Models☆135Sep 8, 2024Updated last year
- ☆30Oct 4, 2025Updated 4 months ago
- BigVectorBench advances vector database benchmarking by defining and evaluating the embedding performance of heterogeneous data and abstr…☆27Jan 17, 2025Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- 한국어 LLM 리더보드 및 모델 성능/안전성 관리☆22Sep 26, 2023Updated 2 years ago
- ☆26Aug 31, 2023Updated 2 years ago
- 🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!☆52Aug 12, 2025Updated 6 months ago
- ☆22Nov 7, 2018Updated 7 years ago
- [ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆56Aug 6, 2025Updated 6 months ago