Accommodating Large Language Model Training over Heterogeneous Environment.
☆25Mar 13, 2025Updated 11 months ago
Alternatives and similar repositories for HexiScale
Users that are interested in HexiScale are comparing it to the libraries listed below
Sorting:
- [ICML 2024] Serving LLMs on heterogeneous decentralized clusters.☆34May 6, 2024Updated last year
- Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs). If you hav…☆23Oct 22, 2025Updated 4 months ago
- ☆15Apr 28, 2023Updated 2 years ago
- This repository is the official implementation of Topology-Informed Graph Transformer (Choi et al., GRaM Workshop at ICML 2024).☆12Dec 28, 2024Updated last year
- [KDD Explore'24]Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities☆17May 7, 2025Updated 9 months ago
- Efficient misspecification uncertainties for linear regression☆16Feb 19, 2026Updated last week
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- ☆11Sep 16, 2024Updated last year
- A variant of Ahash written in C++.☆10Mar 20, 2023Updated 2 years ago
- Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image☆12May 10, 2025Updated 9 months ago
- Code for ASE'24 paper "B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests"☆11Sep 10, 2024Updated last year
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆13Mar 11, 2025Updated 11 months ago
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regime☆13Feb 12, 2024Updated 2 years ago
- Improving word mover’s distance by leveraging self-attention matrix (Published in EMNLP 2023 Findings)☆10Jun 17, 2025Updated 8 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- An LLM inference engine, written in C++☆18Feb 5, 2026Updated 3 weeks ago
- ☆10Feb 12, 2024Updated 2 years ago
- USTC-TD☆12Mar 17, 2025Updated 11 months ago
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- ☆16Mar 14, 2024Updated last year
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Apr 10, 2024Updated last year
- Java-like Language with Static Information Flow Types☆13May 5, 2025Updated 9 months ago
- Bazel defs and rules for building Python projects with nanobind extensions.☆12Feb 4, 2026Updated 3 weeks ago
- ENRICH: multi-purposE dataset for beNchmaRking In Computer vision and pHotogrammetry☆11Mar 13, 2023Updated 2 years ago
- ☆10Sep 21, 2024Updated last year
- Pioneering the design of materials to harness heat.☆26Jan 13, 2022Updated 4 years ago
- LGDCloudSim is a resource management simulation system for large-scale geographically distributed cloud data center scenarios.☆15Nov 15, 2025Updated 3 months ago
- Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"☆11Aug 15, 2024Updated last year
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆47Jul 12, 2024Updated last year
- Parallelized pytorch implementation of iCEM☆14Apr 9, 2024Updated last year
- SelecMix: Debiased Learning by Contradicting-pair Sampling (NeurIPS 2022)☆13Jun 5, 2024Updated last year
- ☆13Jan 15, 2025Updated last year
- ☆13Apr 7, 2025Updated 10 months ago
- A companion for the Causal Artificial Intelligence book.☆15Sep 24, 2025Updated 5 months ago
- Official Code for the NeurIPS 2024 paper "FactorSim: Generative Simulation via Factorized Representation"☆14Sep 26, 2024Updated last year
- A cross-lingual COVID-19 fake news dataset☆14Oct 14, 2021Updated 4 years ago
- Tool for 3D sketch dataset collection☆13Jan 27, 2020Updated 6 years ago
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated last year
- [arXiv 2024] Is Oracle Pruning the True Oracle?☆26Jan 10, 2025Updated last year