krupadav3 / Encoder-Block-in-CUDA
Here's all my Python/Numba (CUDA) code for the encoder block I made :)
☆60Updated last week
Alternatives and similar repositories for Encoder-Block-in-CUDA:
Users that are interested in Encoder-Block-in-CUDA are comparing it to the libraries listed below
- ☆80Updated 2 weeks ago
- ☆45Updated last month
- ☆87Updated last month
- GPU Kernels☆172Updated last week
- This repository contain the simple llama3 implementation in pure jax.☆63Updated 2 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆216Updated 4 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆180Updated last week
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆65Updated last month
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆174Updated 9 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated last month
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆189Updated last week
- ☆177Updated this week
- pytorch from scratch in pure C/CUDA and python☆40Updated 6 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆73Updated this week
- 100 days of learning & making kernels in cuda / triton☆22Updated last month
- Inference Llama 2 in C++☆44Updated last year
- Learnings and programs related to CUDA☆380Updated 2 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated last month
- ☆85Updated 7 months ago
- Learning about CUDA by writing PTX code.☆128Updated last year
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆27Updated last week
- Question paper of courses taught at IISC as part of MTech AI curriculum☆62Updated 5 months ago
- Setting up Vscode to work with Pytorch in C/C++ with CUDA support☆25Updated 3 months ago
- I learn about and explain quantization☆26Updated last year
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated last month
- Coding an LLM and its building blocks from scratch.☆34Updated last month
- An introduction to LLM Sampling☆77Updated 4 months ago
- making the official triton tutorials actually comprehensible☆27Updated last month
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆171Updated this week
- ☆16Updated 3 months ago