My notes on various HPC papers.
☆26Jan 7, 2023Updated 3 years ago
Alternatives and similar repositories for HPC-Paper-Notes
Users that are interested in HPC-Paper-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involveme…☆21Apr 25, 2024Updated 2 years ago
- ☆14May 28, 2019Updated 6 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11May 11, 2018Updated 8 years ago
- This repository contains some tools to monitor the UNC_CBO_CACHE_LOOKUP event of the C-Boxes.☆12Oct 11, 2017Updated 8 years ago
- OCCA Python API: JIT Compilation for Multiple Architectures☆11Dec 20, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- kmc simulation of vacancy-dumbbell transition for BCC lattice.☆13Aug 20, 2025Updated 9 months ago
- nVidia's CUDA accelerated Spin Transformations of Discrete Surfaces, based on the original code and paper by Keenan Crane, Ulrich Pinkall…☆17Mar 14, 2018Updated 8 years ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆32Jun 26, 2024Updated last year
- Cloud native connectivity for Unreal Engine☆10Apr 14, 2023Updated 3 years ago
- XtratuM Mirror☆20Apr 7, 2017Updated 9 years ago
- MIT-licensed stand-alone CUDA utility functions.☆16Jul 3, 2020Updated 5 years ago
- The code for paper 'Hierarchical Policy for Non-prehensile Multi-object Rearrangement with Deep Reinforcement Learning and Monte Carlo Tr…☆21Aug 18, 2023Updated 2 years ago
- 北京大学数算B2022春季大作业“方块大战”☆16Jun 7, 2022Updated 3 years ago
- Single-header logger with pretty console output☆20Mar 23, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Solutions to problems on Hackerearth☆12Jan 18, 2018Updated 8 years ago
- *discontinued* Python module for oceanoptics spectrometers☆21Oct 29, 2015Updated 10 years ago
- An Optimizing Compiler for Recommendation Model Inference☆26Jun 5, 2025Updated 11 months ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆27Aug 27, 2025Updated 8 months ago
- ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.☆27Jul 6, 2023Updated 2 years ago
- Explore Inter-layer Expert Affinity in MoE Model Inference☆16May 6, 2024Updated 2 years ago
- 🍎 One kernel a day keeps high latency away. A hands-on CUDA learning path featuring a rich collection of kernels, from the basics to pea…☆85May 18, 2026Updated last week
- Presentation materials for the 2016 Berkeley C++ Summit☆14Oct 20, 2016Updated 9 years ago
- ☆10Jul 16, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- List all available information about all SYCL devices and platforms☆15Sep 14, 2020Updated 5 years ago
- ☆10Aug 28, 2020Updated 5 years ago
- 基于qt的贪吃蛇游戏☆12Jul 14, 2017Updated 8 years ago
- CUDA solutions for the lab assignments in the UIUC-ECE408 Applied Parallel Programming course.☆19Apr 18, 2023Updated 3 years ago
- study of cutlass☆22Nov 10, 2024Updated last year
- Another|Alternative|Awesome VE Offloading stack using ve-urpc☆16Aug 16, 2023Updated 2 years ago
- Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.☆35Dec 12, 2019Updated 6 years ago
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Feb 28, 2019Updated 7 years ago
- A GPU benchmark suite for autotuners☆19Feb 20, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆31Feb 28, 2025Updated last year
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆21Nov 18, 2019Updated 6 years ago
- A performance-oriented prototyping harness for state of the art Molecular Dynamics algorithms☆17Updated this week
- Intel® SHMEM - Device initiated shared memory based communication library☆32Nov 12, 2025Updated 6 months ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Dec 10, 2022Updated 3 years ago
- Visual studio code extension to open two files in the external tool meld.☆17Nov 21, 2025Updated 6 months ago
- about me☆13Mar 10, 2022Updated 4 years ago