ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).
☆40Sep 10, 2024Updated last year
Alternatives and similar repositories for cachew
Users that are interested in cachew are comparing it to the libraries listed below
Sorting:
- Accelerating Deep Learning Training Through Transparent Storage Tiering (CCGrid'22)☆19Dec 13, 2022Updated 3 years ago
- ☆18Mar 15, 2020Updated 5 years ago
- A Filesystem Semi-Microkernel.☆46Oct 24, 2023Updated 2 years ago
- ☆10May 16, 2021Updated 4 years ago
- 🕹 Implementation for the lesson Compiling Engineering(2020 Spring) in Peking University, adjusted from UCLA CS 132 Project.☆10Jun 21, 2020Updated 5 years ago
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- Chaitin-Briggs register-allocation algorithm (LLVM back-end)☆12Jan 6, 2016Updated 10 years ago
- ☆22Nov 7, 2018Updated 7 years ago
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Oct 7, 2020Updated 5 years ago
- Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfo☆11Feb 12, 2023Updated 3 years ago
- A fault-tolerant RDMA-based disaggregated key-value store with 1-RTT UPDATEs and GETs thanks to the SWARM replication protocol☆14Sep 25, 2024Updated last year
- Paper list for accleration of transformers☆14Jul 1, 2023Updated 2 years ago
- SFS: A Smart OS Scheduler for Serverless Function Workloads (SC'22)☆13Dec 15, 2022Updated 3 years ago
- Deft: A Scalable Tree Index for Disaggregated Memory☆23Apr 23, 2025Updated 10 months ago
- Near-optimal Prefetching System☆33Nov 17, 2021Updated 4 years ago
- ☆16Sep 4, 2023Updated 2 years ago
- ☆56Jan 25, 2021Updated 5 years ago
- [ACM SoCC'22] Pisces: Efficient Federated Learning via Guided Asynchronous Training☆13Apr 28, 2025Updated 9 months ago
- Code for Double Blind CollaborativeLearning (DBCL)☆14May 14, 2021Updated 4 years ago
- This repository is the official implementation of 'EDEN: Communication-Efficient and Robust Distributed Mean Estimation for Federated Lea…☆14Aug 2, 2022Updated 3 years ago
- HW/SW co-designed end-host RPC stack☆20Oct 28, 2021Updated 4 years ago
- ☆15Jan 21, 2023Updated 3 years ago
- Framework of pa code for THU compiler principle course.☆13Dec 18, 2019Updated 6 years ago
- SmartNIC☆14Dec 13, 2018Updated 7 years ago
- Switch-based Training Acceleration for Machine Learning (SwitchML)☆16Apr 13, 2021Updated 4 years ago
- ☆18Dec 11, 2023Updated 2 years ago
- ☆15Jul 13, 2021Updated 4 years ago
- A simple containerized application manage system like Kubernetes, but written in Rust☆19Jun 25, 2022Updated 3 years ago
- egraphs-good website☆18Oct 9, 2024Updated last year
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- Elixir: Train a Large Language Model on a Small GPU Cluster☆15Jun 8, 2023Updated 2 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression☆20Jul 30, 2024Updated last year
- ☆17Dec 9, 2022Updated 3 years ago
- ☆20Nov 7, 2023Updated 2 years ago
- ☆38Jan 15, 2021Updated 5 years ago
- Serverless Deep Learning with TensorFlow and AWS Lambda, published by Packt☆25Jan 18, 2021Updated 5 years ago
- ☆13Feb 22, 2023Updated 3 years ago
- ☆19Jul 26, 2021Updated 4 years ago