EurekaLabsAI / ngramLinks
The n-gram Language Model
☆1,421Updated 9 months ago
Alternatives and similar repositories for ngram
Users that are interested in ngram are comparing it to the libraries listed below
Sorting:
- The Autograd Engine☆607Updated 8 months ago
- The Multilayer Perceptron Language Model☆549Updated 9 months ago
- nanoGPT style version of Llama 3.1☆1,373Updated 9 months ago
- The Tensor (or Array)☆433Updated 9 months ago
- NanoGPT (124M) in 3 minutes☆2,600Updated this week
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆793Updated last month
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,505Updated 2 months ago
- UNet diffusion model in pure CUDA☆605Updated 11 months ago
- Video+code lecture on building nanoGPT from scratch☆4,127Updated 9 months ago
- An autoregressive character-level language model for making more things☆3,094Updated 11 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,372Updated last month
- gpt-2 from scratch in mlx☆387Updated 11 months ago
- DataComp for Language Models☆1,300Updated 2 months ago
- System 2 Reasoning Link Collection☆834Updated 2 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,011Updated last month
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆173Updated 10 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆553Updated this week
- An ML Systems Onboarding list☆789Updated 4 months ago
- High Quality Resources on GPU Programming/Architecture☆587Updated 10 months ago
- What would you do with 1000 H100s...☆1,048Updated last year
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,385Updated 4 months ago
- ☆863Updated last year
- Alex Krizhevsky's original code from Google Code☆192Updated 9 years ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,416Updated 4 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,662Updated 11 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,300Updated last month
- Tutorials on tinygrad☆378Updated 2 weeks ago
- A PyTorch native platform for training generative AI models☆3,838Updated this week
- Learnings and programs related to CUDA☆402Updated 3 months ago
- Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world …☆636Updated last year