ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).
☆40Sep 10, 2024Updated last year
Alternatives and similar repositories for cachew
Users that are interested in cachew are comparing it to the libraries listed below
Sorting:
- Near-optimal Prefetching System☆33Nov 17, 2021Updated 4 years ago
- ☆10Aug 9, 2021Updated 4 years ago
- Accelerating Deep Learning Training Through Transparent Storage Tiering (CCGrid'22)☆19Dec 13, 2022Updated 3 years ago
- Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfo☆12Feb 12, 2023Updated 3 years ago
- ☆20Nov 7, 2023Updated 2 years ago
- ☆12Nov 8, 2024Updated last year
- ☆22Nov 7, 2018Updated 7 years ago
- ☆15Jan 21, 2023Updated 3 years ago
- ☆31Feb 21, 2021Updated 5 years ago
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Oct 7, 2020Updated 5 years ago
- ☆56Jan 25, 2021Updated 5 years ago
- Rust-like error handling in Go☆14Dec 4, 2023Updated 2 years ago
- An RDMA-powered, fast, and scalable Paxos protocol☆26Jun 15, 2019Updated 6 years ago
- ☆79Mar 7, 2022Updated 4 years ago
- ☆16Sep 4, 2023Updated 2 years ago
- ☆11Sep 9, 2022Updated 3 years ago
- ☆38Jan 15, 2021Updated 5 years ago
- SFS: A Smart OS Scheduler for Serverless Function Workloads (SC'22)☆13Dec 15, 2022Updated 3 years ago
- ☆18Dec 11, 2023Updated 2 years ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127May 9, 2022Updated 3 years ago
- ☆10May 16, 2021Updated 4 years ago
- ☆10May 4, 2023Updated 2 years ago
- HW/SW co-designed end-host RPC stack☆20Oct 28, 2021Updated 4 years ago
- Chaitin-Briggs register-allocation algorithm (LLVM back-end)☆12Jan 6, 2016Updated 10 years ago
- egraphs-good website☆18Mar 10, 2026Updated last week
- Distributed ML Training Benchmarks☆27Mar 1, 2023Updated 3 years ago
- 🕹 Implementation for the lesson Compiling Engineering(2020 Spring) in Peking University, adjusted from UCLA CS 132 Project.☆10Jun 21, 2020Updated 5 years ago
- Deft: A Scalable Tree Index for Disaggregated Memory☆23Apr 23, 2025Updated 10 months ago
- Old Probabilistically Bounded Staleness (PBS) analysis for Cassandra (see http://www.bailis.org/blog/using-pbs-in-cassandra-1.2.0/)☆29Jul 10, 2012Updated 13 years ago
- Accelerating Recommender model training by leveraging popular choices -- VLDB 2022☆31Sep 15, 2024Updated last year
- Paper list for accleration of transformers☆14Jul 1, 2023Updated 2 years ago
- ☆17Dec 9, 2022Updated 3 years ago
- 基于FPGA实现用户态中断硬件机制与优化操作系统内核☆10Apr 1, 2025Updated 11 months ago
- Framework of pa code for THU compiler principle course.☆13Dec 18, 2019Updated 6 years ago
- ☆23Jun 21, 2023Updated 2 years ago
- ☆13Feb 22, 2023Updated 3 years ago
- ☆15Aug 16, 2021Updated 4 years ago
- Elixir: Train a Large Language Model on a Small GPU Cluster☆15Jun 8, 2023Updated 2 years ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆58Aug 21, 2024Updated last year