Assembler for NVIDIA Volta and Turing GPUs
☆245Jan 13, 2022Updated 4 years ago
Alternatives and similar repositories for turingas
Users that are interested in turingas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆600Apr 20, 2023Updated 3 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆86Oct 8, 2019Updated 6 years ago
- ☆49Dec 11, 2020Updated 5 years ago
- Assembler for NVIDIA Maxwell architecture☆1,071Jan 3, 2023Updated 3 years ago
- An Open Source Kepler GPU Assembler☆21Jan 23, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Dissecting NVIDIA GPU Architecture☆123Jul 11, 2022Updated 3 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆77Mar 22, 2015Updated 11 years ago
- ☆43Apr 3, 2022Updated 4 years ago
- ☆55Nov 21, 2019Updated 6 years ago
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆95Feb 23, 2023Updated 3 years ago
- Yinghan's Code Sample☆365Jul 25, 2022Updated 3 years ago
- GVProf: A Value Profiler for GPU-based Clusters☆54Mar 24, 2024Updated 2 years ago
- ☆32Aug 24, 2022Updated 3 years ago
- ☆20Aug 26, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CUDAAdvisor: a GPU profiling tool☆53Aug 24, 2018Updated 7 years ago
- ☆39Feb 28, 2020Updated 6 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆35Jul 28, 2020Updated 5 years ago
- collection of benchmarks to measure basic GPU capabilities☆530Oct 24, 2025Updated 8 months ago
- HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs☆42Dec 9, 2024Updated last year
- ☆338Apr 6, 2026Updated 2 months ago
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,967Updated this week
- ☆108May 31, 2025Updated last year
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆33Jun 25, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Nvidia Instruction Set Specification Generator☆340Jul 9, 2024Updated last year
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 7 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,002Sep 19, 2024Updated last year
- Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.☆419Jan 2, 2025Updated last year
- CUDA Kernel Benchmarking Library☆878Jun 22, 2026Updated last week
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- CUDA Tensor Transpose (cuTT) library☆55Aug 10, 2017Updated 8 years ago
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆184Apr 25, 2022Updated 4 years ago
- ☆20May 30, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for…☆1,651Feb 15, 2025Updated last year
- ☆46Jun 19, 2024Updated 2 years ago
- ☆116Apr 19, 2024Updated 2 years ago
- play gemm with tvm☆91Jul 22, 2023Updated 2 years ago
- A library of GPU kernels for sparse matrix operations.☆288Nov 24, 2020Updated 5 years ago
- ☆50Jun 27, 2019Updated 7 years ago
- Fast CUDA Kernels for ResNet Inference.☆183May 26, 2019Updated 7 years ago