piDack / The-ans-for-Programming-Massively-Parallel-ProcessorLinks

大规模并行处理器编程实战第二版答案

☆33

Alternatives and similar repositories for The-ans-for-Programming-Massively-Parallel-Processor

Users that are interested in The-ans-for-Programming-Massively-Parallel-Processor are comparing it to the libraries listed below

Sorting:

Syencil / Programming_Massively_Parallel_Processors
CUDA 6大并行计算模式代码与笔记
☆60Updated 5 years ago
CalvinXKY / BasicCUDA
A tutorial for CUDA&PyTorch
☆150Updated 6 months ago
InfiniTensor / RefactorGraph
分层解耦的深度学习推理引擎
☆74Updated 5 months ago
mrzhuzhe / riven
CPU Memory Compiler and Parallel programing
☆26Updated 8 months ago
XiaoSong9905 / HPC-Notes
Personal Notes for Learning HPC & Parallel Computation [Active Adding New Content]
☆69Updated 3 years ago
zpye / SimpleInfer
A simple neural network inference framework
☆25Updated 2 years ago
njuhope / cuda_sgemm
☆113Updated last year
JieRen98 / SGEMM-SASS-Annotation
☆21Updated 4 years ago
MARD1NO / Learning_CUDA
☆26Updated 4 years ago
zjhellofss / KuiperCourse
b站上的课程
☆75Updated last year
AyakaGEMM / Hands-on-GEMM
☆137Updated last year
li199603 / sgemm_with_cuda
SGEMM optimization with cuda step by step
☆20Updated last year
doorteeth / learn_cuda
☆41Updated 3 years ago
lzyrapx / LeetGPU
Solutions of LeetGPU
☆29Updated this week
AdvancedCompiler / AdvancedCompiler
先进编译实验室的个人主页
☆118Updated 3 months ago
JackonYang / hands-on-tvm
hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.
☆49Updated 2 years ago
RussWong / LLM-engineering
☆24Updated 4 months ago
XiaoSong9905 / CUDA-Optimization-Guide
Xiao's CUDA Optimization Guide [NO LONGER ADDING NEW CONTENT]
☆308Updated 2 years ago
nicolaswilde / cuda-tensorcore-hgemm
☆149Updated 7 months ago
Qwesh157 / conv_op_optimization
This project is about convolution operator optimization on GPU, include GEMM based (Implicit GEMM) convolution.
☆37Updated 7 months ago
Bruce-Lee-LY / memory_pool
Simple and efficient memory pool is implemented with C++11.
☆10Updated 3 years ago
MAhaitao999 / CUDA_Programming
《CUDA编程基础与实践》一书的代码
☆127Updated 3 years ago
YuxueYang1204 / CudaDemo
Implement custom operators in PyTorch with cuda/c++
☆65Updated 2 years ago
BBuf / how-to-optimize-gemm
☆97Updated 3 years ago
sallenkey-wei / cuda-handbook
pdf
☆91Updated 7 years ago
tongzhou80 / nanoPyC
☆70Updated 2 years ago
openmlsys / openmlsys-cuda
Tutorials for writing high-performance GPU operators in AI frameworks.
☆129Updated last year
MegEngine / mperf
mperf是一个面向移动/嵌入式平台的算子性能调优工具箱
☆188Updated last year
MARD1NO / CUDA-PPT
☆102Updated 4 months ago
l1nkr / DL-Compiler-Navigation
Machine Learning Compiler Road Map
☆43Updated last year