SwayamInSync / pytorch-cpp-cuda-starter
Setting up Vscode to work with Pytorch in C/C++ with CUDA support
☆25Updated last month
Alternatives and similar repositories for pytorch-cpp-cuda-starter:
Users that are interested in pytorch-cpp-cuda-starter are comparing it to the libraries listed below
- ☆40Updated 2 weeks ago
- ☆32Updated last month
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆213Updated 2 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆60Updated last week
- 100 days of learning & making kernels in cuda / triton☆20Updated 2 weeks ago
- Coding an LLM and its building blocks from scratch.☆22Updated this week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆169Updated last week
- Question paper of courses taught at IISC as part of MTech AI curriculum☆58Updated 3 months ago
- working implimention of deepseek MLA☆38Updated 2 months ago
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆52Updated 7 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆91Updated 3 weeks ago
- ☆92Updated 3 months ago
- ☆31Updated 6 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 10 months ago
- Learnings and programs related to CUDA☆370Updated last month
- ☆85Updated 6 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 5 months ago
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 8 months ago
- Learning about CUDA by writing PTX code.☆125Updated last year
- Train transformer language models with reinforcement learning.☆18Updated last month
- pytorch from scratch in pure C/CUDA and python☆40Updated 5 months ago
- A really tiny autograd engine☆90Updated 11 months ago
- In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)☆23Updated 4 months ago
- GPU Kernels☆157Updated this week
- ☆99Updated 7 months ago
- ☆20Updated last month
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆56Updated last week
- ☆18Updated last week
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆21Updated last month
- model activation visualiser☆90Updated this week