suneeta-mall / deep_learning_at_scaleLinks
Contains hands-on example code for [O'reilly book "Deep Learning At Scale"](https://www.oreilly.com/library/view/deep-learning-at/9781098145279/).
☆26Updated 11 months ago
Alternatives and similar repositories for deep_learning_at_scale
Users that are interested in deep_learning_at_scale are comparing it to the libraries listed below
Sorting:
- Accelerate Model Training with PyTorch 2.X, published by Packt☆43Updated 11 months ago
- ML/DL Math and Method notes☆61Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆184Updated last week
- Fine-tune an LLM to perform batch inference and online serving.☆111Updated last week
- Slides, notes, and materials for the workshop☆326Updated last year
- ☆168Updated 5 months ago
- ☆190Updated 3 months ago
- making the official triton tutorials actually comprehensible☆34Updated 2 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated 8 months ago
- ☆157Updated last year
- Distributed training (multi-node) of a Transformer model☆68Updated last year
- ☆35Updated last week
- Fine tune Gemma 3 on an object detection task☆43Updated last week
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆105Updated 4 months ago
- GPU Kernels☆178Updated last month
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆46Updated last year
- The repo associated with the Manning Publication☆82Updated 2 months ago
- Reference implementation of Mistral AI 7B v0.1 model.☆29Updated last year
- Slides and recordings of talks hosted by our community☆20Updated 11 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆105Updated last year
- LoRA and DoRA from Scratch Implementations☆203Updated last year
- Repo for ML Models built from scratch such as Self-Attention, Linear +Logistic Regression, PCA, LDA. CNN, LSTM, Neural Networks using Nu…☆49Updated 4 months ago
- 100 days of building GPU kernels!☆430Updated last month
- Tutorial for Harvard Medical School ML from Scratch Series: Transformer from Scratch. Demo the usage of transformer in various domains: M…☆45Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- zero-to-lightning☆29Updated last year
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆181Updated 3 weeks ago
- PyTorch per step fault tolerance (actively under development)☆302Updated this week
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆151Updated last year
- A template to kick-start your Python project ✨🚀☆51Updated 5 months ago