CisMine / GPU-in-ML-DL
Apply GPU in ML and DL
☆52Updated 2 months ago
Alternatives and similar repositories for GPU-in-ML-DL:
Users that are interested in GPU-in-ML-DL are comparing it to the libraries listed below
- ☆296Updated 3 weeks ago
- 100 days of building GPU kernels!☆399Updated last week
- GPU Kernels☆172Updated last week
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆333Updated 2 months ago
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆114Updated 3 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆342Updated last month
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆180Updated last week
- Learnings and programs related to CUDA☆380Updated 2 months ago
- ☆247Updated 3 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆189Updated last week
- NVIDIA tools guide☆132Updated 4 months ago
- ☆159Updated 4 months ago
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆169Updated last month
- Some CUDA example code with READMEs.☆97Updated 2 months ago
- A c/c++ implementation of micrograd: a tiny autograd engine with neural net on top.☆67Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆268Updated 5 months ago
- CUDA Learning guide☆366Updated 10 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆216Updated 4 months ago
- ☆52Updated last week
- Visualization of cache-optimized matrix multiplication☆120Updated last month
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆27Updated last week
- Slides, notes, and materials for the workshop☆325Updated 11 months ago
- repo of paper implementations☆19Updated 2 months ago
- ☆153Updated 9 months ago
- Learning about CUDA by writing PTX code.☆128Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆180Updated last year
- ☆155Updated last year
- Assignments of courses taught at IISC as part of MTech AI curriculum☆111Updated 2 months ago
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆52Updated 9 months ago
- CUDA tutorials or Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.☆181Updated 3 weeks ago