krupadav3 / Encoder-Block-in-CUDALinks

Here's all my Python/Numba (CUDA) code for the encoder block I made :)

☆65

Alternatives and similar repositories for Encoder-Block-in-CUDA

Users that are interested in Encoder-Block-in-CUDA are comparing it to the libraries listed below

Sorting:

AniruddhaChattopadhyay / Books
☆163Updated last month
smolorg / smolgrad
small auto-grad engine inspired from Karpathy's micrograd and PyTorch
☆274Updated 8 months ago
0xD4rky / Vision-Transformers
This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…
☆228Updated 7 months ago
kmohan321 / Research_Papers
☆46Updated 4 months ago
tugot17 / pmpp
Complete solutions to the Programming Massively Parallel Processors Edition 4
☆450Updated last month
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆83Updated 3 months ago
goyalpramod / Foundational-ML-papers
Implementations of Papers that I read, you can read my breakdown in my blog
☆78Updated 2 weeks ago
Maharshi-Pandya / cudacodes
Learnings and programs related to CUDA
☆414Updated last month
YuvrajSingh-mist / Paper-Replications
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆318Updated 2 weeks ago
usamec / lowmem_finetuning
Low memory full parameter finetuning of LLMs
☆52Updated 2 weeks ago
huggingface / huggingface-gemma-recipes
Inference, Fine Tuning and many more recipes with Gemma family of models
☆262Updated 2 weeks ago
AI-Hypercomputer / RecML
☆186Updated this week
YuvrajSingh-mist / Reinforcement-Learning
☆80Updated this week
MarioSieg / magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
☆558Updated this week
SwayamInSync / pytorch-cpp-cuda-starter
Setting up Vscode to work with Pytorch in C/C++ with CUDA support
☆25Updated 6 months ago
JINO-ROHIT / advanced_ml
☆59Updated last week
kmohan321 / LLMs
☆89Updated 4 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 5 months ago
shivendrra / SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆148Updated last year
hkproj / 100-days-of-gpu
☆358Updated 3 months ago
ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆74Updated 3 weeks ago
victor-explore / AI-Q-Papers-IISC-Banglore
Question paper of courses taught at IISC as part of MTech AI curriculum
☆69Updated 8 months ago
atullchaurasia / transformers
Transformers from scratch using PyTorch & NumPy.
☆42Updated 5 months ago
saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆68Updated 5 months ago
cneuralnetwork / ML-Project-CLI
a simple CLI command that will create a template of a generic ML Project
☆81Updated 10 months ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆283Updated this week
kanpuriyanawab / picograd
Rust Implementation of micrograd
☆52Updated last year
ulrichstern / cuda-convnet
Alex Krizhevsky's original code from Google Code
☆195Updated 9 years ago
SwekeR-463 / kernels
learning & making kernels in cuda / triton
☆22Updated last month
Quentin-Anthony / torch-profiling-tutorial
☆447Updated 2 weeks ago