w3hbi / Fundamentals_of_Accelerated_Computing_with_CUDA_PythonLinks
Practice exercises and assessments for NVIDIA DLI's "Fundamentals of Accelerated Computing with CUDA Python" course.
☆18Updated last year
Alternatives and similar repositories for Fundamentals_of_Accelerated_Computing_with_CUDA_Python
Users that are interested in Fundamentals_of_Accelerated_Computing_with_CUDA_Python are comparing it to the libraries listed below
Sorting:
- 100 days of building GPU kernels!☆462Updated 2 months ago
- making the official triton tutorials actually comprehensible☆48Updated 4 months ago
- Notes on quantization in neural networks☆90Updated last year
- GPU Kernels☆190Updated 2 months ago
- NdLinear by Ensemble is a drop-in PyTorch module that shrinks your models with no accuracy loss. It powers the Ensemble Platform—upload a…☆302Updated last month
- ☆350Updated 3 months ago
- ☆180Updated 6 months ago
- ☆125Updated 10 months ago
- Some CUDA example code with READMEs.☆169Updated 4 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆379Updated 4 months ago
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆20Updated 2 weeks ago
- The official repository of Quamba1 [ICLR 2025] & Quamba2 [ICML 2025]☆53Updated last month
- Apply GPU in ML and DL☆52Updated 5 months ago
- ☆68Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆309Updated this week
- ☆43Updated 2 months ago
- vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)☆43Updated 2 weeks ago
- Making of cuda kernel☆16Updated last month
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆196Updated 2 months ago
- ☆61Updated last month
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆360Updated 4 months ago
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆62Updated 3 weeks ago
- E2E AutoML Model Compression Package☆46Updated 4 months ago
- CUDA Learning guide☆403Updated last year
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 8 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆188Updated last month
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆30Updated this week
- ☆43Updated last month
- ☆64Updated this week
- ☆46Updated 3 months ago