nvixnu/pmpp__programming_massively_parallel_processors

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nvixnu/pmpp__programming_massively_parallel_processors)

nvixnu / pmpp__programming_massively_parallel_processors

Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (Third Edition)

☆79

Alternatives and similar repositories for pmpp__programming_massively_parallel_processors

Users that are interested in pmpp__programming_massively_parallel_processors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

guanrenyang / Programming-Massively-Parallel-Processors
View on GitHub
Solution of Programming Massively Parallel Processors
☆51Jan 15, 2024Updated 2 years ago
R100001 / Programming-Massively-Parallel-Processors
View on GitHub
☆238Aug 2, 2024Updated last year
lecoan / pytorch-RLE
View on GitHub
A implement of run-length encoding for Pytorch tensor using CUDA
☆14Apr 7, 2021Updated 5 years ago
sjfeng1999 / gpu-arch-microbenchmark
View on GitHub
Dissecting NVIDIA GPU Architecture
☆126Jul 11, 2022Updated 4 years ago
gpu-mode / triton-tutorials
View on GitHub
☆16May 14, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
dawn-chu / EECS-368-Programming-Massively-Parallel-Processors-with-CUDA
View on GitHub
☆19May 17, 2016Updated 10 years ago
DanieleDeSensi / mammut
View on GitHub
MAchine Micro Management UTilities
☆12Nov 5, 2020Updated 5 years ago
UNITES-Lab / MoE-Quantization
View on GitHub
Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"
☆31Jun 30, 2025Updated last year
gpusgobrr / explore-gemm
View on GitHub
Exploring how optimizations for GEMMs work
☆36Feb 28, 2026Updated 4 months ago
Jaykef / Triton-nanoGPT
View on GitHub
Custom triton kernels for training Karpathy's nanoGPT.
☆19Oct 21, 2024Updated last year
spcl / SMI
View on GitHub
Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware
☆15Mar 1, 2022Updated 4 years ago
IntelLabs / EquiTriton
View on GitHub
EquiTriton is a project that seeks to implement high-performance kernels for commonly used building blocks in equivariant neural networks…
☆74May 25, 2026Updated 2 months ago
Bruce-Lee-LY / cuda_hgemm
View on GitHub
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruct…
☆558Sep 8, 2024Updated last year
tspeterkim / paged-attention-minimal
View on GitHub
a minimal cache manager for PagedAttention, on top of llama3.
☆148Aug 26, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Ammar-Alnagar / Enlightener
View on GitHub
Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …
☆13Jul 28, 2025Updated 11 months ago
ademeure / QuickRunCUDA
View on GitHub
☆20May 30, 2026Updated last month
aschuh703 / ECE408
View on GitHub
☆54Dec 4, 2023Updated 2 years ago
zohourih / FPGAMemBench
View on GitHub
Memory Benchmark for OpenCL-supported Intel FPGAs
☆12Dec 25, 2023Updated 2 years ago
gpu-mode / lectures
View on GitHub
Material for gpu-mode lectures
☆6,355Jun 15, 2026Updated last month
zartbot / gfd
View on GitHub
GPU Functional Descriptor for memory access
☆34May 24, 2026Updated 2 months ago
al8n / rarena
View on GitHub
Lock-free ARENA allocator and a set of lock-free data structures based on the ARENA allocator.
☆17Mar 14, 2026Updated 4 months ago
Snektron / gpumode-amd-fp8-mm
View on GitHub
My submission for the GPUMODE/AMD fp8 mm challenge
☆29Jun 4, 2025Updated last year
meta-pytorch / tritonbench
View on GitHub
Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
☆362Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sudox77 / httpx
View on GitHub
☆23Oct 23, 2025Updated 9 months ago
eedalong / ECE408
View on GitHub
Code base and slides for ECE408：Applied Parallel Programming On GPU.
☆147Jul 2, 2021Updated 5 years ago
gpu-mode / resource-stream
View on GitHub
GPU programming related news and material links
☆2,239Jun 15, 2026Updated last month
FdyCN / PTX-ISA
View on GitHub
CUDA PTX-ISA Document 中文翻译版
☆56Sep 29, 2025Updated 9 months ago
hao-ai-lab / cse234-w25-PA
View on GitHub
☆52Mar 14, 2025Updated last year
LinkedInLearning / python-advanced-2996438
View on GitHub
Advanced Python (German)
☆10Sep 5, 2023Updated 2 years ago
cslab-ntua / artificial-matrix-generator
View on GitHub
An artificial matrix generator in C
☆13Feb 16, 2023Updated 3 years ago
LinkedInLearning / github-essential-training-1-the-basics-4378192
View on GitHub
This is a repository for the LinkedIn Learning course GitHub Essential Training: The Basics
☆13Aug 1, 2023Updated 2 years ago
fionn / feynman
View on GitHub
Calculate allowed interactions in QED
☆10Nov 2, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
amosproj / amos2022ws02-automotive-test-app
View on GitHub
Android Automotive Testapp
☆13Feb 10, 2023Updated 3 years ago
sgl-project / sgl-learning-materials
View on GitHub
Materials for learning SGLang
☆861Jan 5, 2026Updated 6 months ago
CoffeeBeforeArch / spring_2020_tutorial
View on GitHub
"Hardware, Software, and Compilers! Oh My!" tutorial files
☆16Jan 25, 2020Updated 6 years ago
weishengying / cute_gemm
View on GitHub
☆23Aug 14, 2024Updated last year
pm133 / SCF_Szabo
View on GitHub
This is a C version of the SCF code found in Appendix B of Modern Quantum Chemistry, An Introduction to Electronic Structure Theory by A.…
☆10Jan 1, 2019Updated 7 years ago
gpu-mode / pygpubench
View on GitHub
GPU kernel benchmarking
☆47Jun 10, 2026Updated last month
ColfaxResearch / cfx-article-src
View on GitHub
☆193May 7, 2025Updated last year