intel / ai-containers
This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow and PyTorch that have been optimized for Intel platforms. Scaling is done with python, Docker, kubernetes, kubeflow, cnvrg.io, Helm, and other container orchestration frameworks for use in the cloud and on-prem…
☆42Updated this week
Alternatives and similar repositories for ai-containers:
Users that are interested in ai-containers are comparing it to the libraries listed below
- OpenVINO Tokenizers extension☆32Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆62Updated last month
- AMD related optimizations for transformer models☆75Updated 5 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆32Updated last month
- Local LLM Server with NPU Acceleration☆156Updated this week
- ☆84Updated last week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆125Updated 3 weeks ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last month
- Developer kits reference setup scripts for various kinds of Intel platforms and GPUs☆23Updated this week
- ☆41Updated 6 months ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆14Updated last year
- Setup and Installation Instructions for Habana binaries, docker image creation☆25Updated last month
- oneAPI Specification source files☆200Updated this week
- ☆18Updated this week
- ☆46Updated last week
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆11Updated last year
- ☆38Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆86Updated this week
- An NVIDIA AI Workbench example project for exploring the RAPIDS cuDF library☆15Updated 9 months ago
- ☆29Updated this week
- Use safetensors with ONNX 🤗☆54Updated last month
- Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety…☆29Updated this week
- Intel® SHMEM - Device initiated shared memory based communication library☆23Updated 3 weeks ago
- For individual users, watsonx Code Assistant can access a local IBM Granite model☆30Updated 2 months ago
- ☆14Updated this week
- An Awesome list of oneAPI projects☆141Updated 4 months ago
- ☆84Updated this week
- Bandwidth test for ROCm☆54Updated last week
- GenAI Studio is a low code platform to enable users to construct, evaluate, and benchmark GenAI applications. The platform also provide c…☆37Updated last week
- Explainable AI Tooling (XAI). XAI is used to discover and explain a model's prediction in a way that is interpretable to the user. Releva…☆38Updated 2 weeks ago