karpathy / ng-video-lectureLinks
☆3,973Updated last year
Alternatives and similar repositories for ng-video-lecture
Users that are interested in ng-video-lecture are comparing it to the libraries listed below
Sorting:
- An autoregressive character-level language model for making more things☆3,094Updated 11 months ago
- Video+code lecture on building nanoGPT from scratch☆4,127Updated 9 months ago
- An unnecessarily tiny implementation of GPT-2 in NumPy.☆3,360Updated 2 years ago
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆12,014Updated 9 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,662Updated 11 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆41,517Updated 5 months ago
- Neural Networks: Zero to Hero☆13,870Updated 9 months ago
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆21,966Updated 9 months ago
- The n-gram Language Model☆1,421Updated 9 months ago
- Inference Llama 2 in one file of pure C☆18,421Updated 9 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,446Updated 11 months ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,060Updated 8 months ago
- Robust recipes to align language models with human and AI preferences☆5,196Updated last month
- nanoGPT style version of Llama 3.1☆1,373Updated 9 months ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,652Updated last year
- LLM training in simple, raw C/CUDA☆26,706Updated 3 weeks ago
- Solve puzzles. Improve your pytorch.☆3,569Updated 10 months ago
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models☆3,044Updated 10 months ago
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,832Updated last month
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,500Updated last week
- 🧠 A study guide to learn about Transformers☆1,590Updated last year
- Machine Learning Engineering Open Book☆13,840Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆14,662Updated 2 months ago
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"☆12,008Updated 5 months ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,874Updated last year
- ☆2,819Updated this week
- A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)☆9,906Updated last year
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,927Updated 11 months ago
- ☆745Updated 11 months ago
- LLM101n: Let's build a Storyteller☆33,506Updated 10 months ago