stas00 / ml-engineering
Machine Learning Engineering Open Book
☆10,986Updated this week
Related projects: ⓘ
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆9,780Updated this week
- llama3 implementation one matrix multiplication at a time☆13,085Updated 3 months ago
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆37,120Updated last month
- 🔥Highlighting the top ML papers every week.☆9,910Updated last week
- Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step☆26,767Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,031Updated 2 months ago
- DSPy: The framework for programming—not prompting—foundation models☆16,773Updated this week
- Awesome-LLM: a curated list of Large Language Model☆17,413Updated this week
- A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API☆10,011Updated last month
- 📋 A list of open LLMs available for commercial use.☆10,912Updated 2 months ago
- Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory☆15,611Updated this week
- A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)☆9,282Updated 3 months ago
- Go ahead and axolotl questions☆7,554Updated this week
- MLX: An array framework for Apple silicon☆16,445Updated this week
- LLM101n: Let's build a Storyteller☆28,302Updated last month
- Neural Networks: Zero to Hero☆11,524Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆26,822Updated this week
- Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datase…☆11,582Updated last week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆36,216Updated last month
- Explanation to key concepts in ML☆7,029Updated this week
- A guidance language for controlling large language models.☆18,698Updated this week
- LLM training in simple, raw C/CUDA☆23,287Updated this week
- Python SDK, Proxy Server to call 100+ LLM APIs using the OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker,…☆12,231Updated this week
- OCR, layout analysis, reading order, line detection in 90+ languages☆9,849Updated this week
- All things prompt engineering☆5,349Updated 3 months ago
- Structured Text Generation☆8,241Updated this week
- ☆9,299Updated last month
- Convert PDF to markdown quickly with high accuracy☆16,438Updated last week
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆19,845Updated last month
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆7,620Updated 4 months ago