Region-level profiling for CUDA kernels with trace, NVBit, CUPTI, NSys, and an interactive Explorer.
☆118Apr 17, 2026Updated last month
Alternatives and similar repositories for intra-kernel-profiler
Users that are interested in intra-kernel-profiler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- differentiable top-k operator☆22Dec 30, 2024Updated last year
- WeChat official account crawler 微信公众号爬虫☆13Apr 13, 2024Updated 2 years ago
- An eBPF kernel Observable Agent To Spy Performance Issue On OS.☆13Oct 31, 2025Updated 7 months ago
- Research about dataflow architecture☆14Nov 30, 2023Updated 2 years ago
- Build a feature-less eBPF vm on eBPF, just for fun.☆16Mar 10, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆19Dec 4, 2025Updated 6 months ago
- iptables-trace is an eBPF enhanced iptables-TRACE alternative iptables TRACE. GPL-3.0 license☆14Feb 3, 2025Updated last year
- Create and use hybrid workflows to solve problems.☆12Oct 31, 2023Updated 2 years ago
- TORCH_TRACE parser for PT2☆86May 11, 2026Updated 3 weeks ago
- Python Script to Open SJTU Dormitory Smart Lock☆10Sep 12, 2022Updated 3 years ago
- ☆12Feb 23, 2025Updated last year
- lightweight system for profiling XDP applications using kfuncs☆21Mar 25, 2026Updated 2 months ago
- DWARF-based stack walks with eBPF☆13Aug 18, 2021Updated 4 years ago
- ☆17Jan 19, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Pie: Programmable LLM Serving☆169Jun 1, 2026Updated last week
- High-performance system-wide BPF-based workload tracer with Perfetto-backed trace visualization.☆24Updated this week
- cuJSON: A Highly Parallel JSON Parser for GPUs☆47Dec 12, 2025Updated 5 months ago
- Small LD_PRELOAD library to show allocation stats☆14Feb 12, 2026Updated 3 months ago
- This is the project page for our IJCV paper 'Light structure from pin motion: Geometric point light source calibration' by Hiroaki Santo,…☆38Oct 9, 2021Updated 4 years ago
- TPP experimentation on MLIR for linear algebra☆149May 26, 2026Updated last week
- Bazel rules for interacting with bazel build artifacts and bringing them into your workspace☆10Updated this week
- Verified and Efficient Matching of Regular Expressions with Lookaround☆27Dec 18, 2024Updated last year
- ☆36Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A trivial riscv cpu with tomasulo algorithm implemented in Verilog HDL. Support out-of-order execution and pipline and can run in FPGA wi…☆16Jan 4, 2020Updated 6 years ago
- 2023/12/22 电三 420 每周会议技术分享:「容器」的 slides 和附件☆10Dec 22, 2023Updated 2 years ago
- BeePF 是一个用 Go 语言编写的 eBPF 程序加载器和运行时框架。它提供了一套完整的工具链,用于加载、管理和监控 eBPF 程序。☆21Jul 14, 2025Updated 10 months ago
- cache_ext is a framework to customize Linux page cache eviction policies using BPF. Appeared in SOSP 2025.☆90Apr 4, 2026Updated 2 months ago
- Presentation based on the Low Latency Workshop☆20Jul 27, 2018Updated 7 years ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆27Aug 27, 2025Updated 9 months ago
- ☆12Dec 31, 2020Updated 5 years ago
- ☆14Apr 24, 2024Updated 2 years ago
- A speicifically designed KV store for blockchain systems☆12Mar 10, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repo for OSDI 2023 paper: "Ship your Critical Section Not Your Data: Enabling Transparent Delegation with TCLocks"☆21Nov 6, 2024Updated last year
- exercises for Structure and interpretation of computer program☆29Feb 15, 2013Updated 13 years ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆48Aug 18, 2025Updated 9 months ago
- jwilder/nginx-proxy and nginx-proxy/docker-letsencrypt-nginx-proxy-companion launched by docker-compose.☆11Aug 31, 2020Updated 5 years ago
- Collision-detection and collision-avoidance navigation demonstration using a feedforward neural network.☆13Nov 4, 2018Updated 7 years ago
- CarND Semantic Segmentation☆15Apr 1, 2018Updated 8 years ago
- ☆10Apr 9, 2017Updated 9 years ago