cfregly / ai-performance-engineeringLinks
☆141Updated last week
Alternatives and similar repositories for ai-performance-engineering
Users that are interested in ai-performance-engineering are comparing it to the libraries listed below
Sorting:
- Slides, notes, and materials for the workshop☆332Updated last year
- Introduction to Ray Core Design Patterns and APIs.☆72Updated last year
- Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo☆465Updated 2 weeks ago
- Some CUDA example code with READMEs.☆174Updated 7 months ago
- A collection of Machine Learning examples to get started with deploying RAPIDS in the Cloud☆143Updated 11 months ago
- Contains hands-on example code for [O'reilly book "Deep Learning At Scale"](https://www.oreilly.com/library/view/deep-learning-at/9781098…☆28Updated last year
- Scaling Python Machine Learning☆50Updated 2 years ago
- Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs☆84Updated this week
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 4 months ago
- [WIP] Examples for the Intro to ML with Kubeflow book☆206Updated 3 years ago
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆65Updated 3 months ago
- This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.☆434Updated last year
- Effective and Scalable Recommendation Systems☆59Updated last year
- Notebooks for the O'Reilly book "Learning Ray"☆329Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆194Updated 4 months ago
- Serverless Python with Ray☆58Updated 2 years ago
- A catalog of design patterns when building generative AI applications☆196Updated 3 weeks ago
- ☆73Updated last year
- Where GPUs get cooked 👩🍳🔥☆285Updated 3 weeks ago
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆153Updated 5 months ago
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆735Updated this week
- Examples of inference pipelines implemented using https://github.com/SeldonIO/seldon-core☆15Updated 2 years ago
- Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high …☆67Updated 2 months ago
- XGBoost GPU accelerated on Spark example applications☆53Updated 3 years ago
- NVIDIA Deep Learning Institute Jupyter Notebooks for Deep Learning, Data Science, and Accelerated Computing☆48Updated last year
- ☆75Updated last year
- Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with …☆52Updated 8 months ago
- ☆60Updated 2 weeks ago
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆61Updated 3 weeks ago
- Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and L…☆17Updated last year