☆232Aug 2, 2024Updated last year
Alternatives and similar repositories for Programming-Massively-Parallel-Processors
Users that are interested in Programming-Massively-Parallel-Processors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆79Jan 21, 2021Updated 5 years ago
- Solution of Programming Massively Parallel Processors☆51Jan 15, 2024Updated 2 years ago
- Create cohorts from databases utilizing the OMOP CDM☆10May 19, 2025Updated last year
- CUDA 6大并行计算模式 代码与笔记☆63Jul 30, 2020Updated 5 years ago
- ☆49Apr 15, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Material for gpu-mode lectures☆6,098May 9, 2026Updated 2 weeks ago
- Optimized Parallel Tiled Approach to perform Matrix Multiplication by taking advantage of the lower latency, higher bandwidth shared memo…☆16Sep 24, 2017Updated 8 years ago
- ☆14Mar 8, 2025Updated last year
- Single-header C++ implementation of a Z-order octree data structure☆11May 3, 2021Updated 5 years ago
- Dissecting NVIDIA GPU Architecture☆121Jul 11, 2022Updated 3 years ago
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆147Jul 2, 2021Updated 4 years ago
- ☆10Dec 15, 2023Updated 2 years ago
- Step-by-step optimization of CUDA SGEMM☆466Mar 30, 2022Updated 4 years ago
- Jumpstart your custom DNN accelerator today. This project holds scripts to build and start containers that can compile binaries to the ze…☆10Jun 17, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆53Aug 8, 2024Updated last year
- CUDA GPU Benchmark☆38Jan 31, 2025Updated last year
- Awesome code, projects, books, etc. related to CUDA☆36Mar 30, 2026Updated last month
- For popular software systems, the number of daily submitted bug reports is high. Triaging these incoming bugs is a time consuming task. M…☆11Jan 8, 2016Updated 10 years ago
- Verilog implementation of an SPI slave interface. Intially targetted for Atlys devkit (Xilinx Spartan-6) controlled by TotalPhase Cheetah…☆41Jan 8, 2025Updated last year
- Implementations of Multiple View Geometry in Computer Vision and some extended algorithms.☆11Sep 25, 2021Updated 4 years ago
- This repo is designed for the my cuda course projects☆46Mar 20, 2025Updated last year
- Acceleration codes for the Ozaki-scheme on integer matrix multiplication units.☆25Dec 10, 2025Updated 5 months ago
- An reimplement of liif(Learning Continuous Image Representation with Local Implicit Image Function) using lightning+hydra☆11Mar 26, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Aug 19, 2023Updated 2 years ago
- Cleanlab Vizzy: illustrating the core ideas behind the Cleanlab algorithm☆16Apr 19, 2023Updated 3 years ago
- A collection of Topology Methods in Deep Learning☆18Jun 19, 2020Updated 5 years ago
- Fast CUDA matrix multiplication from scratch☆1,196Sep 2, 2025Updated 8 months ago
- [ICML 2025] Adaptive Self-improvement LLM Agentic System for ML Library Development☆17Jan 6, 2026Updated 4 months ago
- Design and UVM Verification of an ALU☆13Jun 14, 2024Updated last year
- moved to https://github.com/Zhaoyilunnn/qdao☆10Aug 30, 2023Updated 2 years ago
- ☆12Mar 28, 2024Updated 2 years ago
- Tapeouts done using OpenFASOC☆19Nov 3, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- [VLDB'24] Blitzcrank is to compress in-memory, OLTP databases. It introduces a new entropy coding algorithm named Delayed Coding.☆39Sep 20, 2024Updated last year
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆484Mar 10, 2025Updated last year
- Dynamic Memory Management for Serving LLMs without PagedAttention☆485May 30, 2025Updated 11 months ago
- CUDA solutions for the lab assignments in the UIUC-ECE408 Applied Parallel Programming course.☆19Apr 18, 2023Updated 3 years ago
- ☆43Jan 13, 2022Updated 4 years ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆762Jun 18, 2025Updated 11 months ago