☆17Dec 9, 2022Updated 3 years ago
Alternatives and similar repositories for harmony
Users that are interested in harmony are comparing it to the libraries listed below
Sorting:
- ☆38Jan 15, 2021Updated 5 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- ☆40Nov 28, 2022Updated 3 years ago
- Python Scritpt which can be embedded into PyTorch model to print the model size.☆19Apr 19, 2021Updated 4 years ago
- Artifacts for our SIGCOMM'22 paper Muri☆43Dec 29, 2023Updated 2 years ago
- ☆52Dec 13, 2022Updated 3 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆27Oct 13, 2024Updated last year
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆58Aug 21, 2024Updated last year
- ☆31Feb 22, 2024Updated 2 years ago
- A framework for pipelined computing on GPU☆30Jul 17, 2019Updated 6 years ago
- ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.☆27Jul 6, 2023Updated 2 years ago
- Near-optimal Prefetching System☆33Nov 17, 2021Updated 4 years ago
- Molecule's artifact for ASPLOS'22☆29Feb 16, 2022Updated 4 years ago
- Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.☆70Mar 20, 2025Updated 11 months ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127May 9, 2022Updated 3 years ago
- The SEAL-CPU backend is a Reference backend engine for HEBench which is a shared library that implements the required functions specified…☆11Mar 3, 2023Updated 3 years ago
- ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).☆40Sep 10, 2024Updated last year
- ☆84Dec 2, 2022Updated 3 years ago
- Continuous Pipelined Speculative Decoding☆16Jan 4, 2026Updated 2 months ago
- Anchored Diffusion Language Model (NeurIPS 2025)☆27Oct 13, 2025Updated 4 months ago
- Implementation of the TFHE homomorphic encryption scheme.☆12May 14, 2021Updated 4 years ago
- liblcm1602 for raspberry pi☆13Sep 25, 2014Updated 11 years ago
- LITS: An Optimized Learned Index for Strings☆13Jun 18, 2025Updated 8 months ago
- ☆10Dec 10, 2024Updated last year
- Rust CLI tool for syncing Claude Code conversation history across machines using git repositories.☆20Feb 24, 2026Updated last week
- Automatic ReLU Reduction☆15Dec 20, 2023Updated 2 years ago
- Disco Stochastic Network Calculator☆10Aug 15, 2017Updated 8 years ago
- Quantization of LLMs and benchmarking.☆10Apr 3, 2024Updated last year
- Repository to go along with the paper "Plumber: Diagnosing and Removing Performance Bottlenecks in Machine Learning Data Pipelines"☆10Mar 31, 2022Updated 3 years ago
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- Memory management simulator, using Hashed Page Table. Page Replacement Algorithms: Least Recently Used (LRU) and Second Chance.☆10Apr 12, 2021Updated 4 years ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- 飞桨模型加密库☆10Nov 13, 2021Updated 4 years ago
- NVMesh Container Storage Interface (CSI) Driver for Kubernetes☆11Oct 7, 2024Updated last year
- Ephemeral distributed filesystem build up from the local storage of several nodes. It is an evolution of AdaFS done inside the NGIO proje…☆37Feb 10, 2022Updated 4 years ago
- ☆11Jun 9, 2023Updated 2 years ago
- Verilog RTL Implementation of DNN☆10Jun 26, 2018Updated 7 years ago
- A project that patch the xiaomi linux system which can connect to chatGPT with WebRTC and Websocket☆10Aug 29, 2025Updated 6 months ago
- sgx-based encrypted deduplication prototype☆14May 14, 2021Updated 4 years ago