tensorflow fork with Salus integration
☆12Jan 7, 2022Updated 4 years ago
Alternatives and similar repositories for tensorflow-salus
Users that are interested in tensorflow-salus are comparing it to the libraries listed below
Sorting:
- Fine-grained GPU sharing primitives☆147Jul 28, 2025Updated 7 months ago
- ☆11Sep 25, 2021Updated 4 years ago
- Protecting Real-Time GPU Kernels on Integrated CPU-GPU SoC Platforms☆12Apr 9, 2018Updated 7 years ago
- notes on reading tensorflow source code☆13Aug 18, 2018Updated 7 years ago
- A Light CNN Framework!☆16Apr 8, 2019Updated 6 years ago
- Adaptive consistency replication with reinforcement learning for large scale globally distributed storage.☆13Sep 29, 2025Updated 5 months ago
- Implementation of Balanced Graph Partitioning Konstantin" - Andreev and Harald Racke (Authors of the paper) by Ivan Vigorito and Lorenzo …☆14Feb 17, 2023Updated 3 years ago
- Batch inference version of Jetson-inference, to run several images recognition on TX1/2 and PC at the same time to save time☆12Dec 20, 2017Updated 8 years ago
- NGINX Lua plugin for adaptive concurrency control used to handle overload in services☆14Dec 30, 2022Updated 3 years ago
- Official implementation of Acc-SpMM: Accelerating General-purpose Sparse Matrix-Matrix Multiplication with GPU Tensor Cores.☆14Nov 13, 2025Updated 4 months ago
- Unit benchmarks of CUDA event APIs.☆17Apr 23, 2024Updated last year
- ROS wrapper around the InertialSense serial protocol https://github.com/inertialsense/inertialsense_serial_protocol.git☆16Jul 1, 2022Updated 3 years ago
- C++11 Work-Stealing Task Scheduler☆37Nov 24, 2019Updated 6 years ago
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127May 9, 2022Updated 3 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Apr 28, 2021Updated 4 years ago
- ESPBench - The Enterprise Stream Processing Benchmark☆15Dec 27, 2023Updated 2 years ago
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago
- Sequence-level 1F1B schedule for LLMs.☆19Jun 4, 2024Updated last year
- ☆14Aug 9, 2021Updated 4 years ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆78Oct 15, 2025Updated 5 months ago
- Scripts used to setup a Spark cluster on EC2☆21Mar 24, 2016Updated 9 years ago
- Velodyne ROS driver for VLS sensors.☆19Nov 29, 2022Updated 3 years ago
- Naos: Serialization-free RDMA networking in Java☆17Aug 17, 2021Updated 4 years ago
- ☆12Feb 12, 2018Updated 8 years ago
- Distributed tracing data from Meta's microservices architecture.☆26Aug 30, 2023Updated 2 years ago
- Kivi: verifying your Kubernetes clusters☆20Nov 8, 2023Updated 2 years ago
- Wall (NanoVG port) demo converted to Reason.☆16Aug 31, 2018Updated 7 years ago
- Website for Systems Research Seminar at UIUC☆20Updated this week
- sample sketch for L3G4200D ADXL345 HMC5883 BMP085 sensors☆20Apr 19, 2017Updated 8 years ago
- Smoothing video traffic to make it a friendlier internet neighbor☆14Apr 23, 2024Updated last year
- Light-weight Performance Variance Detection for Production-run Parallel Applications☆16Aug 28, 2023Updated 2 years ago
- Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions☆163Apr 21, 2019Updated 6 years ago
- ☆19May 13, 2023Updated 2 years ago
- UW-Madison Course Monitor☆10Oct 4, 2019Updated 6 years ago
- ☆11Nov 20, 2020Updated 5 years ago
- A simple, often-used multiprocessor scheduling (load balancing) algorithm is the LPT algorithm (Longest Processing Time) which sorts the …☆11Aug 21, 2018Updated 7 years ago
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆15Jan 14, 2023Updated 3 years ago
- ☆21Dec 15, 2023Updated 2 years ago
- Code for reproducing experiments performed for Accoridon☆13Jun 11, 2021Updated 4 years ago