jiazhihao / sosp19aeView external linksLinks
Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions
☆21Apr 15, 2022Updated 3 years ago
Alternatives and similar repositories for sosp19ae
Users that are interested in sosp19ae are comparing it to the libraries listed below
Sorting:
- Release doc/tutorial/wheels for poseidon-tf☆10Jan 18, 2018Updated 8 years ago
- Switches for HIRE: Resource Scheduling for Data Center In-Network Computing☆13Jan 18, 2021Updated 5 years ago
- Studying GPU Multi-tenancy☆11Jan 11, 2019Updated 7 years ago
- ☆42Sep 8, 2023Updated 2 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆34Feb 10, 2025Updated last year
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆16Dec 9, 2020Updated 5 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- setup the env for vllm users☆16Oct 31, 2023Updated 2 years ago
- ☆13Jan 23, 2021Updated 5 years ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127May 9, 2022Updated 3 years ago
- Machine Learning Inference Graph Spec☆21Jul 27, 2019Updated 6 years ago
- ☆16May 4, 2021Updated 4 years ago
- Experiments evaluating preemption on the NVIDIA Pascal architecture☆17Nov 10, 2016Updated 9 years ago
- ☆17Sep 17, 2015Updated 10 years ago
- ☆19Nov 22, 2017Updated 8 years ago
- Repository linking to the software artifacts used for the MigrOS ATC 2021 paper☆18May 31, 2021Updated 4 years ago
- Getting Starting with NIMBUS-CORE☆10Dec 16, 2023Updated 2 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆142Mar 31, 2023Updated 2 years ago
- System for automated integration of deep learning backends.☆47Aug 15, 2022Updated 3 years ago
- Implementation of Parameter Server using PyTorch communication lib☆42Apr 7, 2019Updated 6 years ago
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆29Oct 31, 2019Updated 6 years ago
- ☆31Jul 18, 2019Updated 6 years ago
- The prototype for NSDI paper "NetHint: White-Box Networking for Multi-Tenant Data Centers"☆26Feb 2, 2024Updated 2 years ago
- Deadline-based hyperparameter tuning on RayTune.☆32Jan 16, 2020Updated 6 years ago
- A Caffe2 implementation of ShuffleNet V2.☆25Aug 6, 2018Updated 7 years ago
- The code base for the I4 prototype, as described in the SOSP '19 paper "I4: Incremental Inference of Inductive Invariants for Verificatio…☆26May 25, 2021Updated 4 years ago
- TMgen is a tool for generating spatial, temporal, and spatio-temporal traffic matrices.☆27Jun 18, 2025Updated 7 months ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆740Jan 26, 2023Updated 3 years ago
- This is the Group-Meeting collections of HKUST System NetworkING (SING) Research Group.☆27Oct 3, 2019Updated 6 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆23Aug 21, 2020Updated 5 years ago
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆32Jun 25, 2025Updated 7 months ago
- A user-level TCP/IP stack with NIC offload of stateful TCP operations☆72Sep 2, 2020Updated 5 years ago
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning☆65Oct 31, 2025Updated 3 months ago
- Artifact evaluation of PLDI'24 paper "Allo: A Programming Model for Composable Accelerator Design"☆33Apr 11, 2024Updated last year
- A collection of reproducible inference engine benchmarks☆38Apr 22, 2025Updated 9 months ago
- A superoptimizing compiler for packet-processing☆30Jun 16, 2023Updated 2 years ago
- The optimization methods in deep learning explained by Vietnamese such as gradient descent, momentum, NAG, AdaGrad, Adadelta, RMSProp, A…☆11Apr 21, 2020Updated 5 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- This is a APRS project for DDP course☆10Apr 1, 2018Updated 7 years ago