Supercomputing for Artificial Intelligence
☆70Jan 17, 2026Updated 2 months ago
Alternatives and similar repositories for supercomputing-for-ai
Users that are interested in supercomputing-for-ai are comparing it to the libraries listed below
Sorting:
- FSDS Webinar 1: Real-Time Machine Learning Inference with Spark Streaming and Kafka☆11Feb 17, 2025Updated last year
- Distributed Performance-portable Stencil Compuitation☆10Jul 9, 2023Updated 2 years ago
- ☆29Dec 12, 2025Updated 3 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- Hardware Division Units☆10Jul 17, 2014Updated 11 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- Bug Bounty Monitor☆15Nov 23, 2020Updated 5 years ago
- Collection of my Google DevFest/Codelab in 2025☆18Mar 4, 2026Updated 2 weeks ago
- Examples calling Warp precompiled (cached) kernels directly from C++ (without Python)☆29Sep 26, 2024Updated last year
- manylinux docker images with CUDA Toolkit☆19Nov 24, 2025Updated 3 months ago
- A Doxygen plugin for MkDocs☆18Dec 4, 2020Updated 5 years ago
- RISC-V vector extension ISA simulation☆16Jun 11, 2019Updated 6 years ago
- Access radare2 from anywhere, anytime.☆43Mar 8, 2026Updated last week
- Step by step implementation of a fast softmax kernel in CUDA☆62Jan 6, 2025Updated last year
- Public API for SPIDA Products☆11Mar 12, 2026Updated last week
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆26Updated this week
- Scientific Solutions for Blender. Bridging Paraview and Blender for Scientific Visualization v.2.0.0: Easy animation import to Blender fr…☆32Feb 26, 2025Updated last year
- ☆16Feb 24, 2026Updated 3 weeks ago
- Training Repo for 2022 NVHPC training☆13Jan 13, 2022Updated 4 years ago
- Map (deep learning) model weights between different model implementations.☆19Mar 9, 2026Updated last week
- Enhance your Google account security with this comprehensive guide. It covers strong passwords, two-factor authentication, phishing preve…☆11Nov 21, 2024Updated last year
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- A simple n-body code for scientific and educational purposes☆14Dec 1, 2022Updated 3 years ago
- Useful scripts, utilities, and tools for Snowflake☆13Jul 15, 2020Updated 5 years ago
- AWS Quick Start Team☆16Oct 3, 2024Updated last year
- a scaling framework for tor traffic balancing 🧦 🧅 ⚖️☆14Nov 10, 2025Updated 4 months ago
- Easier, quicker command-line CUDA profiling☆53Sep 17, 2024Updated last year
- implementing dl from scratch using first principles☆26Jan 10, 2026Updated 2 months ago
- General Matrix Multiplication using NVIDIA Tensor Cores☆28Jan 25, 2025Updated last year
- Collective and Neighbor Collective Optimizations and Extensions☆13Mar 2, 2026Updated 2 weeks ago
- This contains script that reimplements KAPING framework (Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Quest…☆29Mar 2, 2024Updated 2 years ago
- ☆25Jan 7, 2026Updated 2 months ago
- simple ros2 humble ackermann drive that use default gazebo ackermann plugin☆16Sep 10, 2024Updated last year
- ☆36Jan 10, 2026Updated 2 months ago
- pen testing scripts☆12Feb 7, 2021Updated 5 years ago
- Pure Triton kernels for Qwen3.5-27B inference on NVIDIA B200☆81Feb 28, 2026Updated 2 weeks ago
- Automation tool designed to simplify the analysis of PCAP (Packet Capture) files☆18Mar 15, 2024Updated 2 years ago
- A high-performance attention mechanism that computes softmax normalization in a single streaming pass using running accumulators (online …☆29Oct 11, 2025Updated 5 months ago
- Localization of Handwritten and Printed Text in doctors' prescription☆28May 7, 2022Updated 3 years ago