EurekaLabsAI / mlp
The Multilayer Perceptron Language Model
☆533Updated 5 months ago
Alternatives and similar repositories for mlp:
Users that are interested in mlp are comparing it to the libraries listed below
- The Autograd Engine☆555Updated 4 months ago
- The Tensor (or Array)☆419Updated 5 months ago
- The n-gram Language Model☆1,370Updated 5 months ago
- nanoGPT style version of Llama 3.1☆1,300Updated 5 months ago
- UNet diffusion model in pure CUDA☆596Updated 7 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆670Updated this week
- Implementation of Diffusion Transformer (DiT) in JAX☆261Updated 7 months ago
- NanoGPT (124M) in 3 minutes☆2,152Updated this week
- Alex Krizhevsky's original code from Google Code☆190Updated 8 years ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆122Updated 2 months ago
- An ML Systems Onboarding list☆664Updated this week
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆171Updated this week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆756Updated this week
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆510Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆833Updated last week
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆244Updated 2 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆167Updated 5 months ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆163Updated this week
- nice and effective super simple calorie counter web app☆92Updated 7 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆196Updated 3 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Space☆735Updated this week
- For optimization algorithm research and development.☆486Updated last week
- Puzzles for learning Triton☆1,328Updated 2 months ago
- ☆106Updated 3 weeks ago
- Recipes to scale inference-time compute of open models☆971Updated last week
- Fast bare-bones BPE for modern tokenizer training☆142Updated 3 months ago
- A bibliography and survey of the papers surrounding o1☆1,076Updated 2 months ago
- Solve puzzles to improve your tinygrad skills!☆102Updated 4 months ago
- Minimalistic large language model 3D-parallelism training☆1,400Updated this week