krupadav3 / Encoder-Block-in-CUDALinks
Here's all my Python/Numba (CUDA) code for the encoder block I made :)
☆64Updated last month
Alternatives and similar repositories for Encoder-Block-in-CUDA
Users that are interested in Encoder-Block-in-CUDA are comparing it to the libraries listed below
Sorting:
- ☆161Updated this week
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆271Updated 7 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆218Updated 5 months ago
- ☆46Updated 2 months ago
- GPU Kernels☆182Updated last month
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆185Updated 3 weeks ago
- Learnings and programs related to CUDA☆407Updated 4 months ago
- coding CUDA everyday!☆34Updated 2 months ago
- Fine tune Gemma 3 on an object detection task☆57Updated this week
- ☆343Updated 2 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆80Updated last month
- Question paper of courses taught at IISC as part of MTech AI curriculum☆66Updated 6 months ago
- a tiny vectorstore implementation built with numpy.☆62Updated last year
- learning & making kernels in cuda / triton☆21Updated 2 weeks ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆60Updated last month
- Canny edge detector implemented in CUDA C/C++☆27Updated 4 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆67Updated 3 months ago
- ☆89Updated 2 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 8 months ago
- ☆39Updated last month
- ☆41Updated last month
- Learning about CUDA by writing PTX code.☆132Updated last year
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆30Updated this week
- A curated list of awesome mobile machine learning resources.☆139Updated 6 years ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆173Updated 10 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆281Updated last week
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- ☆98Updated last year
- SIMD quantization kernels☆71Updated last week
- Transformers from scratch using PyTorch & NumPy.☆26Updated 4 months ago