ASC-Competition / ASC24-LLM-inference-optimization
The dataset and baseline code for ASC23 LLM inference optimization challenge.
☆34Updated last year
Alternatives and similar repositories for ASC24-LLM-inference-optimization:
Users that are interested in ASC24-LLM-inference-optimization are comparing it to the libraries listed below
- OpenCAEPoro for ASC 2024☆37Updated last year
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆192Updated last month
- Repository for HPCGame 1st Problems.☆61Updated last year
- Summary of some awesome work for optimizing LLM inference☆58Updated last month
- This repository is established to store personal notes and annotated papers during daily research.☆110Updated this week
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆110Updated 7 months ago
- The Zaychik Power Controller server☆13Updated 10 months ago
- A comprehensive guide for beginners in the field of data management and artificial intelligence.☆160Updated 3 months ago
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆20Updated last year
- ☆35Updated 4 months ago
- ☆65Updated 2 years ago
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆71Updated last week
- Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…☆223Updated 2 months ago
- A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of …☆141Updated 7 months ago
- paper and its code for AI System☆275Updated last month
- Wiki fo HPC☆110Updated last month
- A PyTorch-like deep learning framework. Just for fun.☆143Updated last year
- Curated collection of papers in MoE model inference☆81Updated last week
- ☆81Updated 2 months ago
- A repository sharing the literatures about large language models☆70Updated this week
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆146Updated 5 months ago
- Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and o…☆71Updated this week
- ☆56Updated 4 months ago
- Systems for GenAI☆120Updated last week
- ☆99Updated last month
- Documentation for HPC course☆142Updated this week
- ☆83Updated 3 months ago
- High performance Transformer implementation in C++.☆103Updated last month
- Curated collection of papers in machine learning systems☆247Updated this week
- ☆46Updated 3 months ago