☆224Aug 2, 2024Updated last year
Alternatives and similar repositories for Programming-Massively-Parallel-Processors
Users that are interested in Programming-Massively-Parallel-Processors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆77Jan 21, 2021Updated 5 years ago
- Solution of Programming Massively Parallel Processors☆49Jan 15, 2024Updated 2 years ago
- Create cohorts from databases utilizing the OMOP CDM☆10May 19, 2025Updated 10 months ago
- CUDA 6大并行计算模式 代码与笔记☆61Jul 30, 2020Updated 5 years ago
- ☆49Apr 15, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Class of High Performance Computing taken at U.T.P 2017☆120Oct 11, 2017Updated 8 years ago
- Material for gpu-mode lectures☆5,865Feb 1, 2026Updated last month
- Solution to Cartpole balancing problem with the help of reinforcement learning and Deep Neural Networks.☆11May 5, 2023Updated 2 years ago
- Optimized Parallel Tiled Approach to perform Matrix Multiplication by taking advantage of the lower latency, higher bandwidth shared memo…☆16Sep 24, 2017Updated 8 years ago
- ☆14Mar 8, 2025Updated last year
- Single-header C++ implementation of a Z-order octree data structure☆11May 3, 2021Updated 4 years ago
- Dissecting NVIDIA GPU Architecture☆118Jul 11, 2022Updated 3 years ago
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆142Jul 2, 2021Updated 4 years ago
- ☆10Dec 15, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Jumpstart your custom DNN accelerator today. This project holds scripts to build and start containers that can compile binaries to the ze…☆10Jun 17, 2020Updated 5 years ago
- Embedded graphics library to create beautiful UIs for any MCU, MPU and display type.☆11Apr 29, 2024Updated last year
- CUDA GPU Benchmark☆37Jan 31, 2025Updated last year
- Awesome code, projects, books, etc. related to CUDA☆31Feb 3, 2026Updated last month
- ☆18Jan 4, 2024Updated 2 years ago
- Verilog implementation of an SPI slave interface. Intially targetted for Atlys devkit (Xilinx Spartan-6) controlled by TotalPhase Cheetah…☆41Jan 8, 2025Updated last year
- STM32 NFC NXP MFRC630, CLRC663, ISO14443A, ISO14443A-4, ISO7816-4 APDU☆15Mar 23, 2023Updated 3 years ago
- This repo is designed for the my cuda course projects☆43Mar 20, 2025Updated last year
- Implementations of Multiple View Geometry in Computer Vision and some extended algorithms.☆11Sep 25, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Official code for the paper: Scaling Transformers for Discriminative Recommendation via Generative Pretraining☆26Sep 1, 2025Updated 6 months ago
- Triton Compiler related materials.☆42Mar 16, 2026Updated last week
- Acceleration codes for the Ozaki-scheme on integer matrix multiplication units.☆24Dec 10, 2025Updated 3 months ago
- Step-by-step optimization of CUDA SGEMM☆448Mar 30, 2022Updated 3 years ago
- Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.☆10Aug 19, 2023Updated 2 years ago
- Cleanlab Vizzy: illustrating the core ideas behind the Cleanlab algorithm☆16Apr 19, 2023Updated 2 years ago
- A collection of Topology Methods in Deep Learning☆18Jun 19, 2020Updated 5 years ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆20Mar 9, 2026Updated 2 weeks ago
- Tapeouts done using OpenFASOC☆17Nov 3, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated 2 months ago
- small language models training made easy☆13Dec 15, 2024Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- [VLDB'24] Blitzcrank is to compress in-memory, OLTP databases. It introduces a new entropy coding algorithm named Delayed Coding.☆39Sep 20, 2024Updated last year
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆466Mar 10, 2025Updated last year
- ☆27Sep 1, 2025Updated 6 months ago
- Dynamic Memory Management for Serving LLMs without PagedAttention☆466May 30, 2025Updated 9 months ago