Region-level profiling for CUDA kernels with trace, NVBit, CUPTI, NSys, and an interactive Explorer.
☆117Apr 17, 2026Updated last month
Alternatives and similar repositories for intra-kernel-profiler
Users that are interested in intra-kernel-profiler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- differentiable top-k operator☆22Dec 30, 2024Updated last year
- WeChat official account crawler 微信公众号爬虫☆13Apr 13, 2024Updated 2 years ago
- my rc files☆12Mar 16, 2016Updated 10 years ago
- ☆15Jul 28, 2022Updated 3 years ago
- Tutorials for NVIDIA CUPTI samples☆64Nov 3, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CO-RE code for the Netdata eBPF plugin.☆16May 11, 2026Updated last week
- An eBPF kernel Observable Agent To Spy Performance Issue On OS.☆13Oct 31, 2025Updated 6 months ago
- Build a feature-less eBPF vm on eBPF, just for fun.☆16Mar 10, 2024Updated 2 years ago
- ☆18Dec 4, 2025Updated 5 months ago
- iptables-trace is an eBPF enhanced iptables-TRACE alternative iptables TRACE. GPL-3.0 license☆14Feb 3, 2025Updated last year
- Create and use hybrid workflows to solve problems.☆12Oct 31, 2023Updated 2 years ago
- Python Script to Open SJTU Dormitory Smart Lock☆10Sep 12, 2022Updated 3 years ago
- ☆12Feb 23, 2025Updated last year
- lightweight system for profiling XDP applications using kfuncs☆21Mar 25, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- DWARF-based stack walks with eBPF☆13Aug 18, 2021Updated 4 years ago
- cuJSON: A Highly Parallel JSON Parser for GPUs☆46Dec 12, 2025Updated 5 months ago
- The official starter-kit for NeurIPS 2025 mind games competition☆21May 5, 2026Updated 2 weeks ago
- ☆17Jan 19, 2025Updated last year
- High-performance system-wide BPF-based workload tracer with Perfetto-backed trace visualization.☆23Updated this week
- A dataflow compiler☆26Updated this week
- Small LD_PRELOAD library to show allocation stats☆14Feb 12, 2026Updated 3 months ago
- Example of a C++ project with VSCode, Bazel, and working autocomplete☆25Oct 27, 2024Updated last year
- ☆15Apr 28, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- TPP experimentation on MLIR for linear algebra☆148May 10, 2026Updated last week
- End-to-end codebase for finetuning LLMs (LLaMA 2, 3, etc.) with or without DP☆17Sep 23, 2024Updated last year
- Bazel rules for interacting with bazel build artifacts and bringing them into your workspace☆10Jul 24, 2024Updated last year
- Verified and Efficient Matching of Regular Expressions with Lookaround☆27Dec 18, 2024Updated last year
- A trivial riscv cpu with tomasulo algorithm implemented in Verilog HDL. Support out-of-order execution and pipline and can run in FPGA wi…☆16Jan 4, 2020Updated 6 years ago
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Apr 15, 2015Updated 11 years ago
- 2023/12/22 电三 420 每周会议技术分享:「容器」的 slides 和附件☆10Dec 22, 2023Updated 2 years ago
- BeePF 是一个用 Go 语言编写的 eBPF 程序加载器和运行时框架。它提供了一套完整的工具链,用于加载、管理和监控 eBPF 程序。☆21Jul 14, 2025Updated 10 months ago
- cache_ext is a framework to customize Linux page cache eviction policies using BPF. Appeared in SOSP 2025.☆88Apr 4, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- GPU model checker☆13Apr 17, 2019Updated 7 years ago
- Low-overhead Linux CPU profiler as a library☆28Updated this week
- Based on the orignial diff-gaussian-rasterization☆22Nov 6, 2024Updated last year
- Template repository for the Werewolf hackathon☆19Nov 9, 2024Updated last year
- Presentation based on the Low Latency Workshop☆20Jul 27, 2018Updated 7 years ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆27Aug 27, 2025Updated 8 months ago
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year