pengzhao-intel / oneAPI_courseLinks

oneAPI - Data Parallel C++ course for students

☆42

Alternatives and similar repositories for oneAPI_course

Users that are interested in oneAPI_course are comparing it to the libraries listed below

Sorting:

ASC-Competition / ASC24-LLM-inference-optimization
The dataset and baseline code for ASC23 LLM inference optimization challenge.
☆34Updated last year
lcpu-club / hpcgame_1st_problems
Repository for HPCGame 1st Problems.
☆62Updated last year
DicardoX / Research-Space
This repository is established to store personal notes and annotated papers during daily research.
☆125Updated this week
microsoft / ConvStencil
☆27Updated last year
zhaiyi000 / tlm
☆38Updated 10 months ago
Thesys-lab / Helix-ASPLOS25
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆46Updated 6 months ago
PAA-NCIC / PE
performance engineering
☆30Updated 10 months ago
SYSU-SCC / yatcpu-docs
Documentation for YatCPU
☆51Updated last year
snu-comparch / InfiniGen
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)
☆136Updated 10 months ago
SJTU-IPADS / reef
REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…
☆94Updated 2 years ago
chenhongyu2048 / LLM-inference-optimization-paper
Summary of some awesome work for optimizing LLM inference
☆73Updated this week
SJTU-ReArch-Group / Paper-Reading-List
☆108Updated last week
zhang-tlgg / HPC-Lab
HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.
☆24Updated last year
lcpu-club / hpc-wiki
Wiki fo HPC
☆112Updated 4 months ago
interestingLSY / CUDA-From-Correctness-To-Performance-Code
Codes & examples for "CUDA - From Correctness to Performance"
☆98Updated 7 months ago
pkusc / zaychik-power-controller
The Zaychik Power Controller server
☆13Updated last year
InfiniTensor / InfiniTensor
☆238Updated 3 months ago
sunkx109 / GPUs-Specs
Summary of the Specs of Commonly Used GPUs for Training and Inference of LLM
☆44Updated 2 months ago
OpenCAEPlus / OpenCAEPoro_ASC2024
OpenCAEPoro for ASC 2024
☆38Updated last year
LLMServe / SwiftTransformer
High performance Transformer implementation in C++.
☆124Updated 4 months ago
mental2008 / awesome-papers
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and o…
☆105Updated this week
lipracer / cuda-rt-hook
☆36Updated 5 months ago
Hsword / Awesome-Machine-Learning-System-Papers
☆73Updated 3 years ago
hongzhangblaze / CS854-F24
☆37Updated 7 months ago
libxsmm / tpp-pytorch-extension
Intel® Tensor Processing Primitives extension for Pytorch*
☆17Updated 2 weeks ago
SYSU-SCC / sysu-scc-spack-repo
Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.
☆16Updated 6 months ago
goliaro / specinfer-ae
☆21Updated last year
xxyux / SpInfer
SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
☆47Updated 2 months ago
NEO-MLSys25 / NEO
NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading
☆36Updated 3 months ago
pkusys / ElasticFlow
Artifacts for our ASPLOS'23 paper ElasticFlow
☆51Updated last year