☆228Aug 2, 2024Updated last year
Alternatives and similar repositories for Programming-Massively-Parallel-Processors
Users that are interested in Programming-Massively-Parallel-Processors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆78Jan 21, 2021Updated 5 years ago
- Solution of Programming Massively Parallel Processors☆51Jan 15, 2024Updated 2 years ago
- CUDA 6大并行计算模式 代码与笔记☆62Jul 30, 2020Updated 5 years ago
- ☆49Apr 15, 2024Updated 2 years ago
- Material for gpu-mode lectures☆6,040Apr 22, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Solution to Cartpole balancing problem with the help of reinforcement learning and Deep Neural Networks.☆11May 5, 2023Updated 3 years ago
- Optimized Parallel Tiled Approach to perform Matrix Multiplication by taking advantage of the lower latency, higher bandwidth shared memo…☆16Sep 24, 2017Updated 8 years ago
- Single-header C++ implementation of a Z-order octree data structure☆11May 3, 2021Updated 5 years ago
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆144Jul 2, 2021Updated 4 years ago
- ☆10Dec 15, 2023Updated 2 years ago
- Step-by-step optimization of CUDA SGEMM☆460Mar 30, 2022Updated 4 years ago
- Jumpstart your custom DNN accelerator today. This project holds scripts to build and start containers that can compile binaries to the ze…☆10Jun 17, 2020Updated 5 years ago
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆53Aug 8, 2024Updated last year
- Embedded graphics library to create beautiful UIs for any MCU, MPU and display type.☆11Apr 29, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- https://bbuf.github.io/gpu-glossary-zh/☆27Nov 7, 2025Updated 5 months ago
- Tutorial for (PyTorch) + (C++) + (Metal shader)☆16Oct 25, 2025Updated 6 months ago
- Awesome code, projects, books, etc. related to CUDA☆35Mar 30, 2026Updated last month
- ☆18Jan 4, 2024Updated 2 years ago
- ☆28Oct 11, 2022Updated 3 years ago
- STM32 NFC NXP MFRC630, CLRC663, ISO14443A, ISO14443A-4, ISO7816-4 APDU☆15Mar 23, 2023Updated 3 years ago
- Implementations of Multiple View Geometry in Computer Vision and some extended algorithms.☆11Sep 25, 2021Updated 4 years ago
- Acceleration codes for the Ozaki-scheme on integer matrix multiplication units.☆25Dec 10, 2025Updated 4 months ago
- Triton Compiler related materials.☆44Mar 16, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Vector search using only Parquet and DataFusion☆61Feb 11, 2026Updated 2 months ago
- Official code for the paper: Scaling Transformers for Discriminative Recommendation via Generative Pretraining☆28Sep 1, 2025Updated 8 months ago
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Aug 19, 2023Updated 2 years ago
- Cleanlab Vizzy: illustrating the core ideas behind the Cleanlab algorithm☆16Apr 19, 2023Updated 3 years ago
- A collection of Topology Methods in Deep Learning☆18Jun 19, 2020Updated 5 years ago
- Fast CUDA matrix multiplication from scratch☆1,161Sep 2, 2025Updated 8 months ago
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆16Jan 6, 2026Updated 4 months ago
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- small language models training made easy☆13Dec 15, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Design and UVM Verification of an ALU☆13Jun 14, 2024Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- [VLDB'24] Blitzcrank is to compress in-memory, OLTP databases. It introduces a new entropy coding algorithm named Delayed Coding.☆39Sep 20, 2024Updated last year
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆481Mar 10, 2025Updated last year
- Dynamic Memory Management for Serving LLMs without PagedAttention☆482May 30, 2025Updated 11 months ago
- CUDA solutions for the lab assignments in the UIUC-ECE408 Applied Parallel Programming course.☆19Apr 18, 2023Updated 3 years ago
- ☆43Jan 13, 2022Updated 4 years ago