EurekaLabsAI / ngram
The n-gram Language Model
☆1,410Updated 8 months ago
Alternatives and similar repositories for ngram:
Users that are interested in ngram are comparing it to the libraries listed below
- The Autograd Engine☆595Updated 7 months ago
- The Multilayer Perceptron Language Model☆545Updated 8 months ago
- The Tensor (or Array)☆429Updated 8 months ago
- nanoGPT style version of Llama 3.1☆1,352Updated 8 months ago
- NanoGPT (124M) in 3 minutes☆2,465Updated last week
- UNet diffusion model in pure CUDA☆600Updated 9 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆997Updated 2 months ago
- Video+code lecture on building nanoGPT from scratch☆4,030Updated 8 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆783Updated last month
- System 2 Reasoning Link Collection☆824Updated 3 weeks ago
- An ML Systems Onboarding list☆750Updated 2 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆982Updated last month
- DataComp for Language Models☆1,274Updated 3 weeks ago
- ☆865Updated last year
- If tinygrad wasn't small enough for you...☆709Updated last year
- What would you do with 1000 H100s...☆1,029Updated last year
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,561Updated this week
- Tutorials on tinygrad☆361Updated 2 weeks ago
- An autoregressive character-level language model for making more things☆2,989Updated 10 months ago
- Textbook on reinforcement learning from human feedback☆535Updated this week
- A PyTorch native library for large model training☆3,562Updated this week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,912Updated last month
- (WIP) A small but powerful, homemade PyTorch from scratch.☆544Updated this week
- From the Tensor to Stable Diffusion, a rough outline for a 1 week course.☆1,057Updated last week
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world …☆629Updated last year
- gpt-2 from scratch in mlx☆380Updated 10 months ago
- ☆209Updated this week
- Entropy Based Sampling and Parallel CoT Decoding☆3,348Updated 4 months ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆860Updated last month
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆173Updated 8 months ago