hyxcl/nsys_recipes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hyxcl/nsys_recipes)

hyxcl / nsys_recipes

these are custom recipes of nvidia nsight system post collection analysis.

☆16

Alternatives and similar repositories for nsys_recipes

Users that are interested in nsys_recipes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amzn / rdma-core
View on GitHub
RDMA core userspace libraries and daemons
☆15Updated this week
infiniband-radar / infiniband-radar-web
View on GitHub
Monitoring and visualization of InfiniBand Fabrics
☆23Apr 19, 2021Updated 5 years ago
alibaba / TePDist
View on GitHub
TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.
☆97Apr 22, 2023Updated 3 years ago
detel / Median-Filtering-GPU
View on GitHub
High Performance Median Filtering Algorithm Based on NVIDIA GPU Computing
☆18Nov 15, 2017Updated 8 years ago
ntrdma / ntrdma
View on GitHub
Linux tree for ntrdma driver development.
☆11Jun 29, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NVIDIA / nvidia-dlfw-inspect
View on GitHub
The tool facilitates debugging convergence issues and testing new algorithms and recipes for training LLMs using Nvidia libraries such as…
☆21Sep 17, 2025Updated 10 months ago
google / tcpgpudmarxd
View on GitHub
☆10Feb 17, 2026Updated 5 months ago
kwai / Megatron-Kwai
View on GitHub
LLM training technologies developed by kwai
☆71Jun 30, 2026Updated 3 weeks ago
osrg / optcast
View on GitHub
Reduction Server in Rust
☆14Apr 9, 2024Updated 2 years ago
DeepLink-org / AIChipBenchmark
View on GitHub
☆35Mar 27, 2026Updated 4 months ago
llnl / mpi-tools
View on GitHub
Tools for MPI programmers
☆14Sep 21, 2020Updated 5 years ago
UNITES-Lab / Occult
View on GitHub
[ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…
☆13Apr 17, 2025Updated last year
yuyangJin / PerFlow
View on GitHub
Domain-specific framework for performance analysis of parallel programs
☆25Mar 23, 2026Updated 4 months ago
pnnl / rofi
View on GitHub
☆16Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
daos-stack / google-cloud-daos
View on GitHub
Terraform modules for deploying DAOS on GCP
☆11Jan 17, 2024Updated 2 years ago
davejiang / linux
View on GitHub
kernel development code for my work (ioatdma, ntb_hw_intel, idxd, PCI, and CXL related bits)
☆12Jun 23, 2026Updated last month
NVIDIA / mig-parted
View on GitHub
MIG Partition Editor for NVIDIA GPUs
☆259Updated this week
gbxu / autoccl
View on GitHub
[NSDI25] AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training
☆34May 2, 2025Updated last year
siddheshsathe / Valgrind-Log-Parser
View on GitHub
☆11May 17, 2023Updated 3 years ago
ViffyGwaanl / DeepSeek-Api-Test
View on GitHub
Currently, there are many DeepSeek API providers on the market. Use DeepSeek Api Test to test which API performs the best
☆20Feb 13, 2025Updated last year
nstebbins / mcp-manager
View on GitHub
CLI tool for managing Model Context Protocol (MCP) servers in one place & using them across them different clients
☆25Apr 23, 2025Updated last year
lhb8125 / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆19Updated this week
exists-forall / striped_attention
View on GitHub
☆49Nov 10, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
bigcode-project / bigcode-inference-benchmark
View on GitHub
☆19Aug 10, 2024Updated last year
tgamblin / ascr-software-stewardship-rfi-responses
View on GitHub
Responses to 2021 RFI on Stewardship of Software for Scientific and High-Performance Computing
☆16Jan 20, 2022Updated 4 years ago
aws-samples / ec2-topology-aware-for-slurm
View on GitHub
☆13May 30, 2025Updated last year
alibaba / llm-scheduling-artifact
View on GitHub
Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“
☆64Jun 5, 2024Updated 2 years ago
UNITES-Lab / C2R-MoE
View on GitHub
[NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…
☆16Feb 4, 2025Updated last year
Mellanox / nccl-rdma-sharp-plugins
View on GitHub
RDMA and SHARP plugins for nccl library
☆233Apr 3, 2026Updated 3 months ago
ntrdma / ntrdma-ext
View on GitHub
Linux extra (out of tree) kernel modules for ntrdma.
☆24May 23, 2025Updated last year
google / nccl-plugin-gpudirecttcpx
View on GitHub
☆19May 8, 2026Updated 2 months ago
Xing-CHEN18 / NeuralODEs_for_physics
View on GitHub
☆10Mar 2, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
hpcgame / hpcgame-platform-0th
View on GitHub
HPC Game Platform
☆11Apr 20, 2023Updated 3 years ago
openucx / torch-ucc
View on GitHub
pytorch ucc plugin
☆23Jul 8, 2021Updated 5 years ago
JiaweiZhuang / aws-mpi-benchmark
View on GitHub
MPI Benchmark on AWS HPC cluster
☆20Jan 31, 2020Updated 6 years ago
haoyang9804 / HirGen
View on GitHub
A Computational Graph Generator for AI Compiler Fuzzing
☆16May 31, 2023Updated 3 years ago
Twilight92z / Quantize-Watermark
View on GitHub
☆19Nov 6, 2023Updated 2 years ago
facebookresearch / torch_ucc
View on GitHub
Pytorch process group third-party plugin for UCC
☆22Apr 15, 2024Updated 2 years ago
devicescape / aws_dynamo
View on GitHub
AWS DynamoDB Library for C and C++
☆23Mar 21, 2019Updated 7 years ago