A comprehensive hands-on project for learning GPU programming with CUDA and HIP, covering fundamental concepts through advanced optimization techniques.
☆35Nov 20, 2025Updated 4 months ago
Alternatives and similar repositories for gpu-programming-101
Users that are interested in gpu-programming-101 are comparing it to the libraries listed below
Sorting:
- Minimal TPU implementation with 8x8 systolic array and PyTorch integration☆55Jan 26, 2026Updated last month
- Visualizer for large-scale and interactive ray-tracing of neurons☆10Jan 25, 2022Updated 4 years ago
- Code for ICML 2025 paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning☆27Jun 18, 2025Updated 9 months ago
- KFunca: A minimalist, high-performance GPU-based automatic differentiation framework☆29Aug 14, 2025Updated 7 months ago
- Finite-difference option pricer for GPU☆14Feb 29, 2024Updated 2 years ago
- Comparsion of Julia's GPU Kernel based ODE solvers with other open-source GPU ODE solvers☆28Jan 4, 2024Updated 2 years ago
- ☆16Oct 30, 2022Updated 3 years ago
- Julia HPC miniapp using parallel models (MPI.jl, CUDA.jl, AMDGPU.jl, ADIOS2.jl) and Jupyter/Pluto.jl notebooks☆24Jan 28, 2026Updated last month
- A straightforward method to reduce your LLM inference API costs and token usage.☆22May 18, 2025Updated 10 months ago
- Building a computer vision system to count and track two distinct fish species within an aquarium.☆15May 17, 2023Updated 2 years ago
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- API for Asset Service☆15Aug 15, 2024Updated last year
- XXE techniques☆14Oct 10, 2021Updated 4 years ago
- Learning Robot Geometry as Distance Fields: Applications to Whole-body Manipulation☆20Sep 4, 2024Updated last year
- Will share some interesting writeups here :)☆18Oct 18, 2023Updated 2 years ago
- implement GPT-OSS 20B & 120B C++ inference from scratch on AMD GPUs☆170Oct 25, 2025Updated 4 months ago
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆22Mar 6, 2026Updated 2 weeks ago
- A Transformer Model Exploiting Histology Images and Spatial Gene Expression☆22Mar 18, 2025Updated last year
- ☆20Mar 3, 2025Updated last year
- Fetch & Filter Known URLs☆15Aug 3, 2022Updated 3 years ago
- A basic C++ Template Meta Programming☆27Oct 31, 2018Updated 7 years ago
- Publish and share Kedro-Viz static website on GitHub pages in your workflow through this GitHub action☆18Nov 25, 2025Updated 3 months ago
- Train LLM on Hugging Face infra☆69Nov 13, 2025Updated 4 months ago
- Using NVIDIA modulus for airfoil optimizations at different angles.☆25Apr 5, 2023Updated 2 years ago
- A template project for using Vue with D3.js (easy serve on github.io)☆17Mar 29, 2018Updated 7 years ago
- An example web app that display data using Altair, Vega and VueJS☆16May 24, 2018Updated 7 years ago
- Materials relating to the Swift for TensorFlow Dev Summit Presentations☆19Jan 16, 2020Updated 6 years ago
- A language model suite for numbering antigen receptor sequences.☆39Updated this week
- Umbrella will protect your shellcode from the rain.☆31Jun 4, 2025Updated 9 months ago
- WordPress Elementor 3.6.0 3.6.1 3.6.2 RCE POC☆16Apr 17, 2022Updated 3 years ago
- Comprehensive GPU specifications database with 2,824 GPUs across NVIDIA, AMD, and Intel☆66Jan 7, 2026Updated 2 months ago
- High-performance Geometric Multigrid☆40Apr 2, 2019Updated 6 years ago
- A command-line tool (and pre-commit hook) to remove print statements from your Python project.☆21Oct 12, 2023Updated 2 years ago
- ☆11Aug 10, 2021Updated 4 years ago
- output burp body only and auto pretiffy☆20May 1, 2025Updated 10 months ago
- Windows Privilege Escalation☆23Jun 7, 2022Updated 3 years ago
- [ACL 2025] NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering☆22Jul 29, 2025Updated 7 months ago
- ☆18Jun 18, 2025Updated 9 months ago
- Time-Optimal Path Following with Bounded Acceleration and Velocity☆27May 25, 2023Updated 2 years ago