DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆14Jan 8, 2026Updated 3 months ago
Alternatives and similar repositories for DeepSpeed
Users that are interested in DeepSpeed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆18Dec 19, 2024Updated last year
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆209Apr 16, 2026Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆86Updated this week
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Dec 2, 2024Updated last year
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆171Jan 8, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Large Language Model Text Generation Inference on Habana Gaudi☆34Mar 20, 2025Updated last year
- ☆24Oct 9, 2025Updated 6 months ago
- ☆14Mar 1, 2025Updated last year
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆65Jun 30, 2025Updated 9 months ago
- ☆17Apr 21, 2026Updated last week
- Novel image segmentation datasets collected from endoscopic videos of sinus surgery processes☆13Feb 11, 2023Updated 3 years ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆130Sep 23, 2025Updated 7 months ago
- LeetCode plugin code debuging template.☆13Apr 11, 2025Updated last year
- A Prot paper related materials☆11Sep 5, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆63Sep 18, 2025Updated 7 months ago
- Computation using data flow graphs for scalable machine learning☆68Updated this week
- ☆20Mar 27, 2023Updated 3 years ago
- Fast and memory-efficient exact attention☆20Updated this week
- ☆14May 25, 2022Updated 3 years ago
- ☆61Dec 18, 2024Updated last year
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- [EMNLP 2022] Official Pytorch implementation for "Tiny-NewsRec: Efficient and Effective PLM-based News Recommendation"☆18Sep 18, 2023Updated 2 years ago
- ☆19Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆13Nov 25, 2021Updated 4 years ago
- ☆34Dec 22, 2025Updated 4 months ago
- oneAPI Level Zero Specification Headers and Loader☆315Apr 22, 2026Updated last week
- ☆30Apr 23, 2026Updated last week
- PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolution…☆19Jan 22, 2026Updated 3 months ago
- Useful tutorials and recipes for developers doing low-level work with the Graphcore IPU☆21Jun 29, 2022Updated 3 years ago
- Ongoing research training transformer models at scale☆39Updated this week
- Train deepseek r1-like reasoning LLM with ease | 轻松训练1个deepseek r1类的推理LLM☆19Feb 15, 2025Updated last year
- A Gradio Web UI for running local LLM on Intel GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) using IPEX-LLM.☆18Apr 19, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Oct 27, 2024Updated last year
- Democratizing AlphaFold3: an PyTorch reimplementation to accelerate protein structure prediction☆21May 24, 2025Updated 11 months ago
- ☆18Apr 23, 2025Updated last year
- GPGMM, a General-Purpose GPU Memory Management Library.☆36Feb 2, 2026Updated 2 months ago
- Intel® Tensor Processing Primitives extension for Pytorch*☆18Apr 9, 2026Updated 2 weeks ago
- oneCCL Bindings for Pytorch* (deprecated)☆104Dec 31, 2025Updated 4 months ago
- Create a concurrent video analysis pipeline featuring multistream face and human pose detection, vehicle attribute detection, and the abi…☆52Jul 22, 2024Updated last year