intel / ai-containers
This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow and PyTorch that have been optimized for Intel platforms. Scaling is done with python, Docker, kubernetes, kubeflow, cnvrg.io, Helm, and other container orchestration frameworks for use in the cloud and on-prem…
☆27Updated this week
Related projects ⓘ
Alternatives and complementary repositories for ai-containers
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆11Updated 9 months ago
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆155Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆153Updated this week
- OpenVINO Tokenizers extension☆25Updated this week
- AMD related optimizations for transformer models☆57Updated 3 weeks ago
- A curated list of OpenVINO based AI projects☆109Updated 2 weeks ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆11Updated last year
- Large Language Model Text Generation Inference on Habana Gaudi☆27Updated last week
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆38Updated 3 weeks ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆89Updated this week
- The no-code AI toolchain☆75Updated this week
- ☆76Updated this week
- ☆17Updated this week
- Source Code and Usage Samples for the Resources hosted in the NVIDIA AI Enterprise AzureML Registry☆17Updated 3 months ago
- AI Assistant running within your browser.☆45Updated 3 weeks ago
- ☆55Updated this week
- Libraries and tools to support Transfer Learning☆18Updated last month
- ☆39Updated 2 months ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆57Updated 2 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆103Updated last week
- ☆40Updated last month
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆29Updated 6 months ago
- A GPU-driven system framework for scalable AI applications☆109Updated last month
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆63Updated this week
- Setup and Installation Instructions for Habana binaries, docker image creation☆23Updated last month
- ☆30Updated this week
- An Awesome list of oneAPI projects☆126Updated 3 months ago
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆110Updated this week
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 4 months ago