SwayamInSync / pytorch-cpp-cuda-starterLinks

Setting up Vscode to work with Pytorch in C/C++ with CUDA support

☆25

Alternatives and similar repositories for pytorch-cpp-cuda-starter

Users that are interested in pytorch-cpp-cuda-starter are comparing it to the libraries listed below

Sorting:

JINO-ROHIT / advanced_ml
☆52Updated 3 weeks ago
kmohan321 / Research_Papers
☆46Updated 3 months ago
Maharshi-Pandya / cudacodes
Learnings and programs related to CUDA
☆411Updated 2 weeks ago
0xD4rky / Vision-Transformers
This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…
☆227Updated 6 months ago
AniruddhaChattopadhyay / Books
☆161Updated 3 weeks ago
tugot17 / pmpp
Complete solutions to the Programming Massively Parallel Processors Edition 4
☆196Updated 3 weeks ago
naklecha / llm-inference-optimizations-explained
in this repository, i'm going to implement increasingly complex llm inference optimizations
☆63Updated last month
victor-explore / AI-Q-Papers-IISC-Banglore
Question paper of courses taught at IISC as part of MTech AI curriculum
☆66Updated 7 months ago
hkproj / 100-days-of-gpu
☆349Updated 3 months ago
apoorvnandan / lilgrad
pytorch from scratch in pure C/CUDA and python
☆40Updated 9 months ago
cloneofsimo / ptx-tutorial-by-aislop
PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)
☆66Updated 3 months ago
krupadav3 / Encoder-Block-in-CUDA
Here's all my Python/Numba (CUDA) code for the encoder block I made :)
☆65Updated 2 months ago
kabir2505 / tiny-mixtral
☆42Updated 2 months ago
unixpickle / learn-ptx
Learning about CUDA by writing PTX code.
☆133Updated last year
mitutitu16 / Awesome-Mobile-Machine-Learning
A curated list of awesome mobile machine learning resources.
☆142Updated 6 years ago
rkinas / triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
☆378Updated 4 months ago
MarioSieg / magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
☆555Updated last week
facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆87Updated 2 weeks ago
SwekeR-463 / kernels
learning & making kernels in cuda / triton
☆22Updated last month
KhawajaAbaid / micrograd_c
Andrej Kapathy's micrograd implemented in c
☆29Updated 11 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆68Updated 2 months ago
1y33 / 100Days
GPU Kernels
☆188Updated 2 months ago
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆188Updated last month
ivanfioravanti / prompt-eng-ollama-interactive-tutorial
Ollama's Interactive Prompt Engineering Tutorial
☆250Updated 7 months ago
Snektron / gpumode-amd-fp8-mm
My submission for the GPUMODE/AMD fp8 mm challenge
☆27Updated last month
AlexBodner / How_Much_VRAM
☆101Updated 10 months ago
loganwatchorn / notes-pmpp
Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)
☆52Updated 11 months ago
Laz4rz / GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆173Updated 11 months ago
smolorg / smolgrad
small auto-grad engine inspired from Karpathy's micrograd and PyTorch
☆272Updated 7 months ago
AmeyaWagh / llama2.cpp
Inference Llama 2 in C++
☆43Updated last year