SwayamInSync / pytorch-cpp-cuda-starterLinks
Setting up Vscode to work with Pytorch in C/C++ with CUDA support
☆25Updated 10 months ago
Alternatives and similar repositories for pytorch-cpp-cuda-starter
Users that are interested in pytorch-cpp-cuda-starter are comparing it to the libraries listed below
Sorting:
- ☆113Updated 2 weeks ago
- ☆46Updated 8 months ago
- learning & making kernels in cuda / triton☆22Updated 4 months ago
- Learnings and programs related to CUDA☆429Updated 5 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆228Updated 11 months ago
- Here's all my Python/Numba (CUDA) code for the encoder block I made :)☆68Updated 7 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 9 months ago
- ☆45Updated 7 months ago
- pytorch from scratch in pure C/CUDA and python☆41Updated last year
- Implementations of Papers that I read, you can read my breakdown in my blog☆89Updated 2 months ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆75Updated 7 months ago
- Low memory full parameter finetuning of LLMs☆53Updated 5 months ago
- Andrej Kapathy's micrograd implemented in c☆30Updated last year
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆22Updated last year
- A collection of lightweight interpretability scripts to understand how LLMs think☆71Updated this week
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆121Updated 2 months ago
- ☆406Updated 8 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 8 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 6 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆662Updated last week
- Learning about CUDA by writing PTX code.☆150Updated last year
- CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning☆277Updated last month
- Quantized LLM training in pure CUDA/C++.☆224Updated this week
- ☆81Updated last week
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆277Updated last year
- Inference Llama 2 in C++☆43Updated last year
- This repository contains everything you need to become proficient in NLP☆62Updated last year
- 📓 A collection of generative AI open-source repositories that are actively being developed. If you are looking to build a solid profile …☆85Updated 2 months ago
- ☆101Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year