pengzhao-intel / oneAPI_course
oneAPI - Data Parallel C++ course for students
☆42Updated 4 months ago
Alternatives and similar repositories for oneAPI_course:
Users that are interested in oneAPI_course are comparing it to the libraries listed below
- Documentation for YatCPU☆49Updated last year
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆16Updated 3 months ago
- ☆226Updated last month
- The dataset and baseline code for ASC23 LLM inference optimization challenge.☆34Updated last year
- Repository for HPCGame 1st Problems.☆61Updated last year
- Solution of Programming Massively Parallel Processors☆41Updated last year
- Homework solutions for CMU 10-414/714 – Deep Learning Systems: Algorithms and Implementation☆43Updated 2 years ago
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆21Updated last year
- Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]☆61Updated 2 years ago
- ☆25Updated 11 months ago
- ☆100Updated last week
- Codes & examples for "CUDA - From Correctness to Performance"☆86Updated 4 months ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆53Updated 6 months ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆92Updated 2 years ago
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆112Updated 8 months ago
- Summary of some awesome work for optimizing LLM inference☆63Updated this week
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆134Updated 3 years ago
- some hpc project for learning☆20Updated 6 months ago
- ☆10Updated 2 years ago
- This repository is established to store personal notes and annotated papers during daily research.☆112Updated last week
- My Paper Reading Lists and Notes.☆20Updated 2 months ago
- Machine Learning Compiler Road Map☆43Updated last year
- A toy SysY compiler for the PKU compiler course project, 2023 spring.☆12Updated last year
- The Zaychik Power Controller server☆13Updated 11 months ago
- Artifacts for our ASPLOS'23 paper ElasticFlow☆52Updated 10 months ago
- Documentation for HPC course☆143Updated this week
- High performance Transformer implementation in C++.☆105Updated last month
- My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022☆36Updated 11 months ago
- DGEMM on KNL, achieve 75% MKL☆16Updated 2 years ago
- ☆105Updated 3 months ago