cfregly / ai-performance-engineeringLinks
☆131Updated 3 weeks ago
Alternatives and similar repositories for ai-performance-engineering
Users that are interested in ai-performance-engineering are comparing it to the libraries listed below
Sorting:
- Slides, notes, and materials for the workshop☆331Updated last year
- Some CUDA example code with READMEs.☆172Updated 6 months ago
- Introduction to Ray Core Design Patterns and APIs.☆71Updated last year
- Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo☆467Updated 2 months ago
- ☆75Updated last year
- Contains hands-on example code for [O'reilly book "Deep Learning At Scale"](https://www.oreilly.com/library/view/deep-learning-at/9781098…☆28Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆193Updated 3 months ago
- Scaling Python Machine Learning☆50Updated 2 years ago
- Source code for "Enginneering Deep Learning Platforms"☆53Updated 4 months ago
- ☆73Updated last year
- O'Reilly book - Building Machine Learning Systems with a feature store: batch, real-time, and LLMs☆40Updated 2 weeks ago
- A collection of Machine Learning examples to get started with deploying RAPIDS in the Cloud☆143Updated 10 months ago
- Files for my PyTorch book☆22Updated 4 months ago
- See how to augment LLMs with real-time data for dynamic, context-aware apps - Rag + Agents + GraphRAG.☆136Updated last month
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆343Updated this week
- A catalog of design patterns when building generative AI applications☆186Updated last week
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆704Updated 3 weeks ago
- Pretrain Vision and Large Language Models in Python, Published by Packt☆88Updated last year
- [WIP] Examples for the Intro to ML with Kubeflow book☆206Updated 3 years ago
- GPU Kernels☆193Updated 4 months ago
- A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.☆160Updated this week
- A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS☆221Updated 4 months ago
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆153Updated 4 months ago
- Effective and Scalable Recommendation Systems☆60Updated last year
- Transformer Architectures for Generative AI☆89Updated last month
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆191Updated this week
- 100 days of building GPU kernels!☆499Updated 4 months ago
- Where GPUs get cooked 👩🍳🔥☆282Updated last week
- Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high …☆66Updated last month