Accommodating Large Language Model Training over Heterogeneous Environment.
☆32Mar 13, 2025Updated last year
Alternatives and similar repositories for HexiScale
Users that are interested in HexiScale are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.☆37May 6, 2024Updated 2 years ago
- Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you hav…☆25Oct 22, 2025Updated 8 months ago
- ☆15Apr 28, 2023Updated 3 years ago
- Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.☆72Mar 20, 2025Updated last year
- [TMLR 2026] Is Oracle Pruning the True Oracle?☆26Jun 20, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆18May 4, 2023Updated 3 years ago
- Collection of recent methods on 3D Scene Generation from Text Description.☆16Mar 3, 2025Updated last year
- [IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any inte…☆52May 31, 2023Updated 3 years ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆12Jun 18, 2024Updated 2 years ago
- ☆23May 10, 2023Updated 3 years ago
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.☆29Apr 25, 2023Updated 3 years ago
- ☆19Jan 10, 2023Updated 3 years ago
- ☆21Oct 31, 2022Updated 3 years ago
- Compression for Foundation Models☆36Jul 21, 2025Updated 11 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [ICML 2026] Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning☆33Sep 12, 2025Updated 9 months ago
- A Triton-only attention backend for vLLM☆26Mar 17, 2026Updated 3 months ago
- LGDCloudSim is a resource management simulation system for large-scale geographically distributed cloud data center scenarios.☆16Mar 6, 2026Updated 3 months ago
- This repository is established to store personal notes and annotated papers during daily research.☆199Jun 15, 2026Updated 2 weeks ago
- Terminal UI for NVIDIA Nsight Systems profiles — timeline viewer, kernel navigator, NVTX hierarchy☆60Jun 18, 2026Updated last week
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 3 years ago
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆48Jul 12, 2024Updated last year
- This is for SIGMOD submission "Learning-based Progressive Cardinality Estimation for End-to-end Query Execution"☆21Feb 9, 2023Updated 3 years ago
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆98Jun 16, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MultiArchKernelBench: A Multi-Platform Benchmark for Kernel Generation☆58Jun 18, 2026Updated last week
- ONCache: A Cache-Based Low-Overhead Container Overlay Network☆21Jun 7, 2025Updated last year
- ☆10Jun 19, 2023Updated 3 years ago
- Dotfile management with bare git☆22Jun 8, 2026Updated 3 weeks ago
- ☆94Jul 3, 2022Updated 3 years ago
- AI model training on heterogeneous, geo-distributed resources☆44Nov 24, 2025Updated 7 months ago
- Python logging package for easy reproducible experimenting in research☆41Jul 29, 2025Updated 11 months ago
- It contains Data Augmentaion, Strided convolution, Batch Normalization, Leaky Relu, Global Average pooling, L2 Regularization, learning …☆12Jun 3, 2018Updated 8 years ago
- Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines☆19Dec 8, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆25May 26, 2021Updated 5 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- Examples for MS-AMP package.☆30Jul 17, 2025Updated 11 months ago
- Demo code for CVPR2023 paper "Sparsifiner: Learning Sparse Instance-Dependent Attention for Efficient Vision Transformers"☆15Jul 4, 2023Updated 2 years ago
- Bayesian Wi-Fi rate control☆21Nov 30, 2016Updated 9 years ago
- Easy design, testing, and deployment of optical data center networks for everyone.☆72May 7, 2026Updated last month
- Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs☆36Sep 21, 2025Updated 9 months ago