ASC-Competition / ASC24-LLM-inference-optimizationLinks

The dataset and baseline code for ASC23 LLM inference optimization challenge.

☆32

Alternatives and similar repositories for ASC24-LLM-inference-optimization

Users that are interested in ASC24-LLM-inference-optimization are comparing it to the libraries listed below

Sorting:

OpenCAEPlus / OpenCAEPoro_ASC2024
OpenCAEPoro for ASC 2024
☆37Updated last year
MLSys-Learner-Resources / Awesome-MLSys-Blogger
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
☆265Updated 7 months ago
hpcgame / hpcgame_1st_problems
Repository for HPCGame 1st Problems.
☆63Updated last year
pkusc / zaychik-power-controller
The Zaychik Power Controller server
☆13Updated last year
PKUFlyingPig / MIT6.5940_TinyML
Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing
☆51Updated 7 months ago
hpcgame / hpc-wiki
Wiki fo HPC
☆116Updated 2 weeks ago
Strivin0311 / llms-learning
A repository sharing the literatures about large language models
☆98Updated last month
zhang-tlgg / HPC-Lab
HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.
☆25Updated 2 years ago
interestingLSY / CUDA-From-Correctness-To-Performance-Code
Codes & examples for "CUDA - From Correctness to Performance"
☆104Updated 9 months ago
interestingLSY / swiftLLM
A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of …
☆239Updated 2 months ago
TreeAI-Lab / Awesome-KV-Cache-Management
This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…
☆175Updated last week
chenhongyu2048 / LLM-inference-optimization-paper
Summary of some awesome work for optimizing LLM inference
☆95Updated 2 months ago
MoE-Inf / awesome-moe-inference
Curated collection of papers in MoE model inference
☆225Updated last week
y9c / m5C-UBSseq
🧪 Ultrafast bisulfite
☆36Updated last year
mental2008 / awesome-papers
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and o…
☆120Updated last week
hongzhangblaze / CS854-F24
☆43Updated last week
lambda7xx / awesome-AI-system
paper and its code for AI System
☆318Updated 3 months ago
SuDIS-ZJU / llm-inference-all-in-one
☆13Updated 5 months ago
thu-cs-lab / HPC-Lab-Docs
Documentation for HPC course
☆153Updated last month
Sunt-ing / stick
A PyTorch-like deep learning framework. Just for fun.
☆156Updated last year
guch8017 / USTC_CS_EXAM
中科大计算机学院部分课程的试卷
☆78Updated 2 weeks ago
snu-comparch / InfiniGen
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
☆149Updated last year
Zefan-Cai / Awesome-LLM-KV-Cache
Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.
☆345Updated 5 months ago
DicardoX / Research-Space
This repository is established to store personal notes and annotated papers during daily research.
☆140Updated this week
adsl-rg / adsl-rg.github.io
☆12Updated last week
PKUFlyingPig / CS149-parallel-computing
Learning materials for Stanford CS149 : Parallel Computing
☆231Updated 4 years ago
galeselee / Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…
☆264Updated 5 months ago
FlexFusion / FlexFusion
The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221
☆22Updated 3 months ago
Hsword / Awesome-Machine-Learning-System-Papers
☆74Updated 3 years ago
YaoJiayi / CacheBlend
☆125Updated 3 weeks ago