☆74Jun 29, 2023Updated 2 years ago
Alternatives and similar repositories for nvvmir-samples
Users that are interested in nvvmir-samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python bindings for libNVVM☆39Apr 3, 2014Updated 12 years ago
- Haskell bindings for libNVVM☆20Apr 1, 2014Updated 12 years ago
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sources☆123Apr 18, 2025Updated last year
- An llvm pass for counting global uncoalesced acceses for cuda code via dynamic analysis.☆14Nov 17, 2018Updated 7 years ago
- Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels☆14Aug 26, 2015Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- cuASR: CUDA Algebra for Semirings☆47Aug 22, 2022Updated 3 years ago
- LLVM Plugin to Instrument Global Memory Accesses in CUDA Kernels☆10Jun 8, 2020Updated 5 years ago
- SYSU-ARCH is a LAB that focuses on the use and extending of simulators.☆10Dec 19, 2022Updated 3 years ago
- A framework for pipelined computing on GPU☆30Jul 17, 2019Updated 6 years ago
- ☆67Oct 10, 2024Updated last year
- CUPTI GPU Profiler☆40Feb 26, 2019Updated 7 years ago
- D bindings and wrapper library for the MXNet deep learning library☆14Sep 11, 2019Updated 6 years ago
- Project ARES represents a joint effort between LANL and ORNL to introduce a common compiler representation and tool-chain for HPC applica…☆10Nov 30, 2016Updated 9 years ago
- outline and links for PLDI 2022 tutorial☆17Jun 13, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆52Apr 7, 2026Updated last month
- GLSL code generator to aid use of Vulkan's descriptor set indexing☆14Apr 20, 2019Updated 7 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆12Dec 2, 2017Updated 8 years ago
- ☆19Nov 21, 2022Updated 3 years ago
- some RL algorithms☆19Dec 9, 2016Updated 9 years ago
- ngAP's artifact for ASPLOS'24☆25Jul 29, 2025Updated 9 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated last year
- Generate simple index ranges in C++ and CUDA C++☆39Jun 14, 2023Updated 2 years ago
- Colby Hall's C++ Standard Library☆11Jan 13, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GPUOCelot: A dynamic compilation framework for PTX☆291Jul 31, 2023Updated 2 years ago
- Sample programs for the LLVM PTX back-end☆41Aug 27, 2015Updated 10 years ago
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communication☆29Jul 23, 2023Updated 2 years ago
- GPGPU-SIM 使用篇☆14Nov 12, 2022Updated 3 years ago
- ☆17Oct 15, 2023Updated 2 years ago
- ☆20Feb 21, 2022Updated 4 years ago
- Fast Point Overlap Test☆19Jun 17, 2018Updated 7 years ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆15Jun 24, 2020Updated 5 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆33Mar 15, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A decentralized unique ID generator (int64)☆22Jun 15, 2016Updated 9 years ago
- ☆27Mar 26, 2025Updated last year
- LHCSim is a 3D physics simulation engine developed based on taichi☆17Jul 20, 2022Updated 3 years ago
- A GPU FP32 computation method with Tensor Cores.☆27Dec 8, 2025Updated 5 months ago
- A cron job wrapper that wraps jobs and enables better error reproting and command timeouts.☆29Feb 1, 2022Updated 4 years ago
- Material and work for O'Reilly courses and publications☆11May 19, 2020Updated 6 years ago
- Rebuild YatSenOS On RISC-V 64.☆23Jan 6, 2022Updated 4 years ago