krupadav3 / Encoder-Block-in-CUDALinks
Here's all my Python/Numba (CUDA) code for the encoder block I made :)
☆65Updated 2 months ago
Alternatives and similar repositories for Encoder-Block-in-CUDA
Users that are interested in Encoder-Block-in-CUDA are comparing it to the libraries listed below
Sorting:
- ☆161Updated 3 weeks ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆227Updated 6 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆272Updated 7 months ago
- Learnings and programs related to CUDA☆411Updated 2 weeks ago
- ☆349Updated 3 months ago
- ☆46Updated 3 months ago
- GPU Kernels☆188Updated 2 months ago
- Complete solutions to the Programming Massively Parallel Processors Edition 4☆196Updated 3 weeks ago
- This repository contain the simple llama3 implementation in pure jax.☆67Updated 4 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆81Updated 2 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆309Updated 3 weeks ago
- a tiny vectorstore implementation built with numpy.☆62Updated last year
- ☆186Updated 2 weeks ago
- coding CUDA everyday!☆35Updated 2 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆188Updated last month
- Fine tune Gemma 3 on an object detection task☆69Updated this week
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆63Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆223Updated last week
- Alex Krizhevsky's original code from Google Code☆194Updated 9 years ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆66Updated 7 months ago
- ☆323Updated this week
- ☆40Updated last month
- learning & making kernels in cuda / triton☆22Updated last month
- ☆89Updated 3 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 4 months ago
- ☆97Updated last year
- (WIP) A small but powerful, homemade PyTorch from scratch.☆555Updated last week
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆52Updated 11 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 3 months ago
- making the official triton tutorials actually comprehensible☆45Updated 3 months ago