DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆14Jan 8, 2026Updated 4 months ago
Alternatives and similar repositories for DeepSpeed
Users that are interested in DeepSpeed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆18Dec 19, 2024Updated last year
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆42Feb 3, 2025Updated last year
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆210Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆88May 5, 2026Updated 2 weeks ago
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Dec 2, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆172Jan 8, 2026Updated 4 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆34Mar 20, 2025Updated last year
- ☆14Mar 1, 2025Updated last year
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆65Jun 30, 2025Updated 10 months ago
- Demo on iGPU for FFmpeg decode and scale, OpenVINO inference. this is zero-copy solution, which means No frame data copy from CPU to iGPU…☆17Jan 25, 2023Updated 3 years ago
- ☆11Jun 29, 2021Updated 4 years ago
- ☆20Apr 9, 2019Updated 7 years ago
- ☆60Mar 6, 2026Updated 2 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆130Sep 23, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- LeetCode plugin code debuging template.☆13Apr 11, 2025Updated last year
- oneAPI Level Zero Conformance & Performance test content☆61Updated this week
- 🇰🇷파이토치 한국 사용자 모임 홈페이지 저장소입니다. (Repo. for PyTorch Korea User Group website🇰🇷)☆22Updated this week
- ☆28Jan 7, 2023Updated 3 years ago
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆64Sep 18, 2025Updated 8 months ago
- Computation using data flow graphs for scalable machine learning☆68Updated this week
- ☆20Mar 27, 2023Updated 3 years ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- Fast and memory-efficient exact attention☆20Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆61Dec 18, 2024Updated last year
- A conda-smithy repository for ambertools.☆11Apr 23, 2026Updated 3 weeks ago
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- oneAPI Level Zero Specification Headers and Loader☆317May 11, 2026Updated last week
- ☆30May 13, 2026Updated last week
- Data Files for "Deep diversification of an AAV capsid protein by machine learning"☆18Mar 9, 2021Updated 5 years ago
- PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolution…☆19Jan 22, 2026Updated 3 months ago
- Assets for AnyLabeling app☆14May 5, 2023Updated 3 years ago
- Intel® XeSS Plugin for Unity* Engine☆38Mar 30, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- MAD (Model Automation and Dashboarding)☆36May 11, 2026Updated last week
- Ongoing research training transformer models at scale☆40Updated this week
- Intel® End-to-End AI Optimization Kit☆31Jul 18, 2024Updated last year
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆149May 7, 2026Updated last week
- The Intel® Automated Self-Checkout Reference Package provides critical components required to build and deploy a self-checkout use case u…☆32Updated this week
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆28Sep 5, 2024Updated last year
- ☆16Oct 27, 2024Updated last year