krupadav3 / Encoder-Block-in-CUDALinks
Here's all my Python/Numba (CUDA) code for the encoder block I made :)
☆66Updated 3 months ago
Alternatives and similar repositories for Encoder-Block-in-CUDA
Users that are interested in Encoder-Block-in-CUDA are comparing it to the libraries listed below
Sorting:
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆226Updated 7 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆278Updated 9 months ago
- ☆238Updated last week
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆471Updated 2 months ago
- Learnings and programs related to CUDA☆415Updated last month
- ☆64Updated this week
- ☆46Updated 4 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆324Updated 2 weeks ago
- ☆89Updated 4 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated this week
- a tiny vectorstore implementation built with numpy.☆63Updated last year
- GPU Kernels☆193Updated 4 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 10 months ago
- Implementations of Papers that I read, you can read my breakdown in my blog☆81Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆266Updated last month
- ☆471Updated 3 weeks ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆623Updated this week
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 5 months ago
- ☆44Updated 3 months ago
- ☆362Updated 4 months ago
- Low memory full parameter finetuning of LLMs☆52Updated last month
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆53Updated last year
- A really tiny autograd engine☆95Updated 3 months ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆71Updated 8 months ago
- a simple CLI command that will create a template of a generic ML Project☆82Updated 10 months ago
- Andrej Kapathy's micrograd implemented in c☆29Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆68Updated 6 months ago
- ☆96Updated last year
- Fine tune Gemma 3 on an object detection task☆77Updated last month