Light-weight Performance Variance Detection for Production-run Parallel Applications
☆16Aug 28, 2023Updated 2 years ago
Alternatives and similar repositories for VAPRO
Users that are interested in VAPRO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A GPU FP32 computation method with Tensor Cores.☆27Dec 8, 2025Updated 6 months ago
- Experimental LLVM backend for Android applications (HGraph IR-to-IR translation).☆28Nov 29, 2022Updated 3 years ago
- tensorflow fork with Salus integration☆12Jan 7, 2022Updated 4 years ago
- 西电操作系统课设避坑指南☆10Sep 7, 2020Updated 5 years ago
- Fortran IO Netcdf Assembly☆19Sep 12, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official implementation of Acc-SpMM: Accelerating General-purpose Sparse Matrix-Matrix Multiplication with GPU Tensor Cores.☆15Nov 13, 2025Updated 7 months ago
- This is the open-source site for XFDetector (ASPLOS'20)☆11Mar 5, 2021Updated 5 years ago
- FlipIt: An LLVM Based Fault Injector for HPC☆15May 14, 2021Updated 5 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Apr 28, 2021Updated 5 years ago
- Domain-specific framework for performance analysis of parallel programs☆25Mar 23, 2026Updated 2 months ago
- Sequence-level 1F1B schedule for LLMs.☆19Jun 4, 2024Updated 2 years ago
- Protocol-Aware Correlated Crash Explorer for Distributed Storage Systems☆16Nov 14, 2016Updated 9 years ago
- C ABCI libraries☆14Sep 10, 2017Updated 8 years ago
- Test cases for MIPS CPU implementation☆12Dec 26, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- NVIDIA DPU OPs collection☆15Mar 6, 2023Updated 3 years ago
- ☆13Jan 23, 2021Updated 5 years ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆192Feb 11, 2026Updated 4 months ago
- Paper: "Aggregating Capacity in FL through Successive Layer Training for Computationally-Constrained Devices"☆18Jan 10, 2024Updated 2 years ago
- Create beegfs server and client☆23Dec 2, 2021Updated 4 years ago
- Sampled simulation of multi-threaded applications using LoopPoint methodology☆25Feb 21, 2026Updated 3 months ago
- ☆23Mar 31, 2012Updated 14 years ago
- Tools and library to manipulate EFI variables.☆10Apr 21, 2026Updated last month
- 西安电子科技大学视觉开源☆13Sep 2, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GNU Gzip with Kunpeng optimization.☆12Mar 30, 2022Updated 4 years ago
- ☆20Jul 7, 2017Updated 8 years ago
- Mirror site speedtest☆12Dec 4, 2023Updated 2 years ago
- ☆25Jan 10, 2023Updated 3 years ago
- Forked from https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/☆25Jun 14, 2019Updated 7 years ago
- ProtoText is a efficient python library, offering dict-like operations and text format serialization to google protobuf objects.☆20Jan 15, 2020Updated 6 years ago
- A flexible C++ formatting library designed for i18n, using embedded script to output plural forms, grammatical gender, etc. correctly☆12May 3, 2026Updated last month
- ☆15May 18, 2023Updated 3 years ago
- Used for testing the metadata performance of a file system☆26Nov 29, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 西电机器学习大作业☆12May 30, 2022Updated 4 years ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆126Jun 23, 2022Updated 3 years ago
- ☆41Jun 5, 2024Updated 2 years ago
- Open source version of DOCA GPUNetIO and DOCA Verbs libraries (limited features) to enable GDAKI technology on RDMA (IB and RoCE)☆58May 15, 2026Updated last month
- Implementation of Scaffold and Fedprox for Federated Learning using PyTorch☆24Jun 22, 2022Updated 3 years ago
- verbs profiling library☆22Sep 22, 2023Updated 2 years ago
- Rust LLVM Practises☆17Dec 29, 2020Updated 5 years ago