intelligent-machine-learning/dlrover

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/intelligent-machine-learning/dlrover)

intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System

☆1,673

Alternatives and similar repositories for dlrover

Users that are interested in dlrover are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

antgroup / glake
View on GitHub
GLake: optimizing GPU memory management and IO transmission.
☆501Mar 24, 2025Updated last year
volcengine / veScale
View on GitHub
Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs
☆1,031Mar 3, 2026Updated 4 months ago
kvcache-ai / Mooncake
View on GitHub
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
☆5,968Updated this week
NVIDIA / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆17,181Updated this week
intelligent-machine-learning / tfplus
View on GitHub
An extension library of tensorflow to accelerate industrial recommendation system model training
☆19Nov 27, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
intelligent-machine-learning / atorch
View on GitHub
An industrial extension library of pytorch to accelerate large scale model training
☆62Aug 13, 2025Updated 11 months ago
MingXiangL / AttentionShift
View on GitHub
Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation
☆155Oct 18, 2024Updated last year
sql-machine-learning / elasticdl
View on GitHub
Kubernetes-native Deep Learning Framework
☆744Jan 26, 2024Updated 2 years ago
bytedance / flux
View on GitHub
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
☆1,345Aug 28, 2025Updated 10 months ago
SiyangLi99 / open-alteryx-macro
View on GitHub
Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…
☆156May 25, 2024Updated 2 years ago
alibaba / Pai-Megatron-Patch
View on GitHub
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
☆1,586Dec 15, 2025Updated 7 months ago
NVIDIA / TransformerEngine
View on GitHub
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…
☆3,441Updated this week
deepspeedai / Megatron-DeepSpeed
View on GitHub
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆2,257Aug 14, 2025Updated 11 months ago
shenjunjiekoda / knight
View on GitHub
kight is a static analysis tool for c/c++ programs.
☆213Dec 27, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
elleryqueenhomels / AI_for_Atari
View on GitHub
Deep Reinforcement Learning Algorithms for solving Atari 2600 Games
☆143Mar 23, 2023Updated 3 years ago
CGCL-codes / YiTu
View on GitHub
YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…
☆254Jan 7, 2026Updated 6 months ago
kubedl-io / kubedl
View on GitHub
Run your deep learning workloads on Kubernetes more easily and efficiently.
☆532Mar 4, 2024Updated 2 years ago
MingXiangL / DEVIL
View on GitHub
Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].
☆274Dec 3, 2024Updated last year
elleryqueenhomels / google_sketcher
View on GitHub
Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.
☆143Mar 23, 2023Updated 3 years ago
xdit-project / xDiT
View on GitHub
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
☆2,659Jul 14, 2026Updated last week
volcano-sh / volcano
View on GitHub
A Cloud Native Batch System (Project under CNCF)
☆5,803Updated this week
Rhythm-Byte / SchemaDiff
View on GitHub
☆246Nov 24, 2024Updated last year
flashinfer-ai / flashinfer
View on GitHub
FlashInfer: Kernel Library for LLM Serving
☆6,014Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Credit-card-monitoring-and-fraud-check / Credit_card_monitoring_and_check
View on GitHub
A code repository designed to show the best GitHub has to offer.
☆165Jun 30, 2024Updated 2 years ago
alibaba / Megatron-LLaMA
View on GitHub
Best practice for training LLaMA models in Megatron-LM
☆666Jan 2, 2024Updated 2 years ago
PeiranLi0930 / TorchProject
View on GitHub
☆249Jul 19, 2023Updated 3 years ago
jtun-coder / JtunRouter
View on GitHub
It is an Android-based application that enables managing hotspot properties through a web interface, providing mobile routing functionali…
☆156Jul 14, 2026Updated last week
wYaobiz / awesome-self-sovereign-identity
View on GitHub
An awesome list of self-sovereign identity resources.
☆137Jul 9, 2024Updated 2 years ago
PeiranLi0930 / L-SVD
View on GitHub
Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition
☆306Aug 18, 2024Updated last year
ZivJia / hmi-workspace
View on GitHub
An Workspace for HMI tools
☆163Jul 11, 2024Updated 2 years ago
pentilm / FactAI
View on GitHub
Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…
☆147Jun 2, 2023Updated 3 years ago
weiwensangsang / golang-internal
View on GitHub
This project features optimized Go language, expert source code, concurrent processing, and industry-best practices.
☆142Mar 14, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ByteDance-Seed / Triton-distributed
View on GitHub
Distributed Compiler based on Triton for Parallel Systems
☆1,498Updated this week
banggx / morgana-form
View on GitHub
莫甘娜问卷表单编辑器，低代码快速搭建表单，AI表单生成，表单数据搜集统计
☆147Jun 21, 2026Updated last month
NaishengZhang / book-recommendation-system
View on GitHub
Book Recommendation System
☆234May 2, 2024Updated 2 years ago
NVIDIA / nccl
View on GitHub
Optimized primitives for collective multi-GPU communication
☆4,902Updated this week
NVIDIA / nccl-tests
View on GitHub
NCCL Tests
☆1,600Jul 9, 2026Updated 2 weeks ago
Project-HAMi / HAMi
View on GitHub
Heterogeneous GPU Sharing on Kubernetes
☆4,042Updated this week
corescriptions / indexer
View on GitHub
Inscriptions on CoreDao, powered by Insdexer.
☆147Mar 20, 2024Updated 2 years ago