CisMine / GPU-in-ML-DL
Apply GPU in ML and DL
☆48Updated last month
Alternatives and similar repositories for GPU-in-ML-DL:
Users that are interested in GPU-in-ML-DL are comparing it to the libraries listed below
- ☆212Updated this week
- GPU Kernels☆157Updated this week
- 100 days of building GPU kernels!☆321Updated this week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆169Updated last week
- ☆142Updated 2 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆313Updated 2 weeks ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆308Updated last month
- Some CUDA example code with READMEs.☆93Updated 3 weeks ago
- Learnings and programs related to CUDA☆370Updated last month
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆110Updated 2 months ago
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆52Updated 7 months ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆154Updated last week
- ☆234Updated 2 months ago
- NVIDIA tools guide☆119Updated 2 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆250Updated 4 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆213Updated 2 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆174Updated last year
- A c/c++ implementation of micrograd: a tiny autograd engine with neural net on top.☆66Updated last year
- ☆67Updated last year
- ☆41Updated 3 weeks ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆58Updated 3 months ago
- CUDA Learning guide☆349Updated 9 months ago
- 100 days of learning & making kernels in cuda / triton☆20Updated 2 weeks ago
- Learning about CUDA by writing PTX code.☆125Updated last year
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆217Updated 6 months ago
- ☆40Updated 2 weeks ago
- From zero to hero CUDA for accelerating maths and machine learning on GPU.☆181Updated this week
- ☆152Updated last year
- UNet diffusion model in pure CUDA☆600Updated 9 months ago
- High-Performance SGEMM on CUDA devices☆87Updated 2 months ago