Relaxed-System-Lab/HexiScale

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Relaxed-System-Lab/HexiScale)

Relaxed-System-Lab / HexiScale

Accommodating Large Language Model Training over Heterogeneous Environment.

☆32

Alternatives and similar repositories for HexiScale

Users that are interested in HexiScale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Relaxed-System-Lab / HexGen
View on GitHub
[ICML 2024] Serving LLMs on heterogeneous decentralized clusters.
☆37May 6, 2024Updated 2 years ago
AFDWang / Hetu-Galvatron
View on GitHub
Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you hav…
☆25Oct 22, 2025Updated 8 months ago
eth-easl / sailor
View on GitHub
AI model training on heterogeneous, geo-distributed resources
☆46Nov 24, 2025Updated 7 months ago
ZongqianLi / 500xCompressor
View on GitHub
[ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models
☆65Mar 9, 2026Updated 4 months ago
Giotyp / GPU-Roofline-Python
View on GitHub
☆16Apr 28, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ParCIS / Chimera
View on GitHub
Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.
☆72Mar 20, 2025Updated last year
DS3Lab / Decentralized_FM_alpha
View on GitHub
☆18May 4, 2023Updated 3 years ago
ZhenyuSun-Walker / Awesome-Text-to-3D-Plus
View on GitHub
Collection of recent methods on 3D Scene Generation from Text Description.
☆17Mar 3, 2025Updated last year
alpa-projects / tensorflow-alpa
View on GitHub
☆23May 10, 2023Updated 3 years ago
Youhe-Jiang / IJCAI2023-OptimalShardedDataParallel
View on GitHub
[IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any inte…
☆52May 31, 2023Updated 3 years ago
SchrodingerZhu / ahash-cxx
View on GitHub
A variant of Ahash written in C++.
☆10Mar 20, 2023Updated 3 years ago
PanZaifeng / FastTree-Artifact
View on GitHub
☆32Mar 24, 2025Updated last year
PositionalHidden / PositionalHidden
View on GitHub
To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …
☆12Jun 18, 2024Updated 2 years ago
NEO-MLSys25 / NEO
View on GitHub
NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading
☆99Jun 16, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lyj20071013 / Triton-FlashAttention
View on GitHub
This repository contains multiple implementations of Flash Attention optimized with Triton kernels, showcasing progressive performance im…
☆11Mar 26, 2026Updated 3 months ago
EleutherAI / radioactive-lab
View on GitHub
Adapting the "Radioactive Data" paper to work for text models
☆13Dec 23, 2020Updated 5 years ago
DS3Lab / AC-SGD
View on GitHub
Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.
☆29Apr 25, 2023Updated 3 years ago
Networked-System-and-Security-Group / Themis
View on GitHub
ICNP'25-THEMIS: Addressing Congestion-Induced Unfairness in Long-Haul RDMA Networks
☆16Jun 27, 2026Updated 3 weeks ago
shengkai16 / ONCache
View on GitHub
ONCache: A Cache-Based Low-Overhead Container Overlay Network
☆21Jun 7, 2025Updated last year
digs-uwo / dcsim
View on GitHub
DCSim: A Data Centre Simulation Tool for Evaluating Dynamic Virtualized Resource Management
☆20Apr 29, 2018Updated 8 years ago
lzhangbv / acpsgd
View on GitHub
[ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
☆10Apr 28, 2023Updated 3 years ago
codecaution / EvoMoE
View on GitHub
☆21Oct 31, 2022Updated 3 years ago
apl-cornell / jif
View on GitHub
Java-like Language with Static Information Flow Types
☆14May 5, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
agiresearch / CoRE
View on GitHub
LLM as Interpreter for Natural Language Programming, Pseudo-code Programming and Flow Programming of AI Agents
☆48Jul 24, 2024Updated last year
DicardoX / Research-Space
View on GitHub
This repository is established to store personal notes and annotated papers during daily research.
☆200Jun 28, 2026Updated 3 weeks ago
cwfparsonson / trafpy
View on GitHub
Network traffic in Python.
☆24Mar 14, 2023Updated 3 years ago
JelixLi / Tetris
View on GitHub
☆19Jan 10, 2023Updated 3 years ago
ezelikman / justonebyte
View on GitHub
☆10Jun 19, 2023Updated 3 years ago
DS3Lab / DT-FM
View on GitHub
☆94Jul 3, 2022Updated 4 years ago
slipegg / LGDCloudSim
View on GitHub
LGDCloudSim is a resource management simulation system for large-scale geographically distributed cloud data center scenarios.
☆16Mar 6, 2026Updated 4 months ago
WalterBabyRudin / Courseware
View on GitHub
☆11Jan 12, 2021Updated 5 years ago
unist-ssl / JABAS
View on GitHub
"JABAS: Joint Adaptive Batching and Automatic Scaling for DNN Training on Heterogeneous GPUs" (EuroSys '25)
☆16Apr 7, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
PKU-DAIR / Hetu
View on GitHub
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
☆339Dec 13, 2025Updated 7 months ago
microsoft / glinthawk
View on GitHub
An LLM inference engine, written in C++
☆20Mar 30, 2026Updated 3 months ago
MingSun-Tse / smilelogging
View on GitHub
Python logging package for easy reproducible experimenting in research
☆41Jul 29, 2025Updated 11 months ago
mingukkang / MNIST-Tensorflow-Code
View on GitHub
It contains Data Augmentaion, Strided convolution, Batch Normalization, Leaky Relu, Global Average pooling, L2 Regularization, learning …
☆12Jun 3, 2018Updated 8 years ago
chuny9743 / AI4WaterEnv
View on GitHub
Webpage: https://chuny9743.github.io/AI4WaterEnv_Webpage/
☆35Apr 14, 2026Updated 3 months ago
cansik / open-opal
View on GitHub
Examples to control the Opal C1 from within python.
☆17May 7, 2023Updated 3 years ago
awslabs / optimizing-multitask-training-through-dynamic-pipelines
View on GitHub
Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
☆19Dec 8, 2023Updated 2 years ago