krupadav3 / Encoder-Block-in-CUDALinks
Here's all my Python/Numba (CUDA) code for the encoder block I made :)
☆63Updated last month
Alternatives and similar repositories for Encoder-Block-in-CUDA
Users that are interested in Encoder-Block-in-CUDA are comparing it to the libraries listed below
Sorting:
- ☆160Updated 2 weeks ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆218Updated 5 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆268Updated 6 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆76Updated last month
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆184Updated last week
- Learnings and programs related to CUDA☆402Updated 3 months ago
- Fine tune Gemma 3 on an object detection task☆43Updated this week
- Question paper of courses taught at IISC as part of MTech AI curriculum☆65Updated 6 months ago
- ☆328Updated last month
- ☆89Updated last month
- ☆39Updated 3 weeks ago
- coding CUDA everyday!☆31Updated last month
- ☆35Updated last week
- This repository contain the simple llama3 implementation in pure jax.☆64Updated 3 months ago
- ☆46Updated 2 months ago
- aesthetic tensor visualiser☆22Updated last month
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated 2 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆67Updated 2 months ago
- 100 days of learning & making kernels in cuda / triton☆22Updated 2 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 7 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆196Updated last month
- GPU Kernels☆178Updated last month
- A curated list of awesome mobile machine learning resources.☆137Updated 6 years ago
- ☆179Updated this week
- lossily compress representation vectors using product quantization☆54Updated last month
- Inference Llama 2 in C++☆43Updated last year
- look how they massacred my boy☆63Updated 7 months ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆58Updated last week
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated 2 months ago
- a tiny vectorstore implementation built with numpy.☆62Updated last year