ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).
☆40Sep 10, 2024Updated last year
Alternatives and similar repositories for cachew
Users that are interested in cachew are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- Near-optimal Prefetching System☆33Nov 17, 2021Updated 4 years ago
- ☆11Aug 9, 2021Updated 4 years ago
- Accelerating Deep Learning Training Through Transparent Storage Tiering (CCGrid'22)☆19Dec 13, 2022Updated 3 years ago
- Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfo☆11Feb 12, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆20Nov 7, 2023Updated 2 years ago
- ☆12Nov 8, 2024Updated last year
- ☆22Nov 7, 2018Updated 7 years ago
- ☆15Jan 21, 2023Updated 3 years ago
- ☆31Feb 21, 2021Updated 5 years ago
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Oct 7, 2020Updated 5 years ago
- ☆57Jan 25, 2021Updated 5 years ago
- Rust-like error handling in Go☆14Dec 4, 2023Updated 2 years ago
- SmartNIC☆14Dec 13, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CUDA benchmarks for measuring GPU utilization and interference☆16Feb 11, 2025Updated last year
- A tracing tool to analyze the I/O behavior of a program.☆12Sep 25, 2019Updated 6 years ago
- An RDMA-powered, fast, and scalable Paxos protocol☆26Jun 15, 2019Updated 6 years ago
- ☆16Sep 4, 2023Updated 2 years ago
- ☆11Sep 9, 2022Updated 3 years ago
- ☆38Jan 15, 2021Updated 5 years ago
- SFS: A Smart OS Scheduler for Serverless Function Workloads (SC'22)☆13Dec 15, 2022Updated 3 years ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127May 9, 2022Updated 3 years ago
- ☆18Dec 11, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10May 16, 2021Updated 4 years ago
- HW/SW co-designed end-host RPC stack☆20Oct 28, 2021Updated 4 years ago
- Chaitin-Briggs register-allocation algorithm (LLVM back-end)☆12Jan 6, 2016Updated 10 years ago
- ☆20Jul 26, 2021Updated 4 years ago
- 🕹 Implementation for the lesson Compiling Engineering(2020 Spring) in Peking University, adjusted from UCLA CS 132 Project.☆10Jun 21, 2020Updated 5 years ago
- Distributed ML Training Benchmarks☆27Mar 1, 2023Updated 3 years ago
- Accelerating Recommender model training by leveraging popular choices -- VLDB 2022☆31Sep 15, 2024Updated last year
- Old Probabilistically Bounded Staleness (PBS) analysis for Cassandra (see http://www.bailis.org/blog/using-pbs-in-cassandra-1.2.0/)☆29Jul 10, 2012Updated 13 years ago
- ☆72Oct 10, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Paper list for accleration of transformers☆14Jul 1, 2023Updated 2 years ago
- ☆17Dec 9, 2022Updated 3 years ago
- The code for both the framework and experiments from the NSDI '19 paper "Loom: Flexible and Efficient NIC Packet Scheduling"☆31Feb 4, 2019Updated 7 years ago
- 基于FPGA实现用户态中断硬件机制与优化操作系统内核☆10Apr 1, 2025Updated last year
- ☆23Jun 21, 2023Updated 2 years ago
- ☆15Aug 16, 2021Updated 4 years ago