EurekaLabsAI / mlpLinks

The Multilayer Perceptron Language Model

☆578

Alternatives and similar repositories for mlp

Users that are interested in mlp are comparing it to the libraries listed below

Sorting:

EurekaLabsAI / micrograd
The Autograd Engine
☆656Updated last year
EurekaLabsAI / tensor
The Tensor (or Array)
☆451Updated last year
EurekaLabsAI / ngram
The n-gram Language Model
☆1,461Updated last year
karpathy / nano-llama31
nanoGPT style version of Llama 3.1
☆1,438Updated last year
ash-01xor / bpe.c
Simple Byte pair Encoding mechanism used for tokenization process . written purely in C
☆137Updated 11 months ago
ulrichstern / cuda-convnet
Alex Krizhevsky's original code from Google Code
☆199Updated 9 years ago
Laz4rz / GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆172Updated last year
MarioSieg / magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
☆643Updated last week
kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆295Updated last year
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆820Updated 2 months ago
karpathy / calorie
nice and effective super simple calorie counter web app
☆101Updated last year
smolorg / smolgrad
small auto-grad engine inspired from Karpathy's micrograd and PyTorch
☆276Updated 11 months ago
LambdaLabsML / distributed-training-guide
Best practices & guides on how to write distributed pytorch training code
☆517Updated this week
rkinas / triton-resources
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
☆421Updated 7 months ago
policy-gradient / GRPO-Zero
Implementing DeepSeek R1's GRPO algorithm from scratch
☆1,638Updated 6 months ago
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆917Updated 5 months ago
huggingface / picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
☆1,863Updated 2 months ago
Maharshi-Pandya / cudacodes
Learnings and programs related to CUDA
☆422Updated 3 months ago
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆167Updated 4 months ago
1y33 / 100Days
GPU Kernels
☆203Updated 6 months ago
changjonathanc / flex-nano-vllm
FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.
☆300Updated 2 months ago
Quentin-Anthony / nanoMPI
Simple MPI implementation for prototyping or learning
☆286Updated 2 months ago
karpathy / transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆144Updated 3 years ago
karpathy / lecun1989-repro
Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world …
☆667Updated last year
gpu-mode / profiling-cuda-in-torch
☆174Updated last year
facebookresearch / optimizers
For optimization algorithm research and development.
☆543Updated last week
a-hamdi / GPU
100 days of building GPU kernels!
☆521Updated 6 months ago
bkitano / llama-from-scratch
Llama from scratch, or How to implement a paper without crying
☆579Updated last year
mlops-discord / gpu-optimization-workshop
Slides, notes, and materials for the workshop
☆333Updated last year
mesozoic-egg / tinygrad-notes
Tutorials on tinygrad
☆431Updated 2 weeks ago