xiuxiazhang/KeplerAs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xiuxiazhang/KeplerAs)

xiuxiazhang / KeplerAs

An Open Source Kepler GPU Assembler

☆22

Alternatives and similar repositories for KeplerAs

Users that are interested in KeplerAs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hyqneuron / asfermi
View on GitHub
assembler for NVIDIA FERMI. Imported from Google Code
☆77Mar 22, 2015Updated 11 years ago
PAA-NCIC / PPoPP2017_artifact
View on GitHub
Third party assembler and GEMM library for NVIDIA Kepler GPU
☆86Oct 8, 2019Updated 6 years ago
daadaada / turingas
View on GitHub
Assembler for NVIDIA Volta and Turing GPUs
☆246Jan 13, 2022Updated 4 years ago
hummingtree / cuda-graph-with-dynamic-parameters
View on GitHub
☆17Aug 9, 2022Updated 3 years ago
daadaada / gas
View on GitHub
☆49Dec 11, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
forhappy / aliyun-pk-report-2012
View on GitHub
aliyun pk report 2012
☆20Oct 31, 2012Updated 13 years ago
cuMF / culda_cgs
View on GitHub
Efficient LDA solution on GPUs.
☆24Aug 20, 2018Updated 7 years ago
knotman90 / cuStreamComp
View on GitHub
Efficient CUDA Stream Compaction Library
☆34Jun 9, 2023Updated 3 years ago
ap-hynninen / cutt
View on GitHub
CUDA Tensor Transpose (cuTT) library
☆55Aug 10, 2017Updated 8 years ago
SNU-HPCS / NeuroSync
View on GitHub
NeuroSync: A Scalable and Accurate Brain Simulation System using Safe and Efficient Speculation (HPCA 2022)
☆14Nov 9, 2022Updated 3 years ago
vinx13 / tvm-cuda-int8-benchmark
View on GitHub
Benchmark of TVM quantized model on CUDA
☆112Jun 19, 2020Updated 6 years ago
NVlabs / SASSI
View on GitHub
Flexible GPGPU instrumentation
☆91Oct 10, 2019Updated 6 years ago
NervanaSystems / maxas
View on GitHub
Assembler for NVIDIA Maxwell architecture
☆1,074Jan 3, 2023Updated 3 years ago
thozza / fedora_rpm_xmind
View on GitHub
XMind application packaged into RPM (for Fedora)
☆10Dec 9, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
MatanHamilis / one_stencil
View on GitHub
Multiple 1-stencil implementations using nvidia cuda.
☆12Dec 2, 2017Updated 8 years ago
chenweiphd / OpenEDA-ChipGPT-Hub
View on GitHub
☆12Jun 22, 2023Updated 3 years ago
RRZE-HPC / asmbench
View on GitHub
A Benchmark Toolkit for Assembly Instructions Using the LLVM JIT
☆18Oct 26, 2020Updated 5 years ago
gwik / spark-cookbook
View on GitHub
chef cookbook to install Apache Spark
☆10Jul 17, 2015Updated 11 years ago
ikuokuo / start-scaled-yolov4
View on GitHub
Start Scaled YOLOv4
☆10Jan 9, 2021Updated 5 years ago
MKlimenko / check_compile_times
View on GitHub
Check various boost headers impact on the compilation time
☆13Jul 11, 2021Updated 5 years ago
md2z34 / winograd_gpu
View on GitHub
GPU implementation of Winograd convolution
☆10Oct 23, 2017Updated 8 years ago
jmarranz / jnieasy
View on GitHub
JNIEasy - Java Native Objects based on JNI
☆10Aug 30, 2023Updated 2 years ago
SourceryTools / nvptx-tools
View on GitHub
nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.
☆53Apr 7, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
raingo / caffe-parameter-server
View on GitHub
Proof of concept prototype to perform distributed training using BVLC/caffe, based on a parameter server implementation using MPI. Data p…
☆13May 7, 2015Updated 11 years ago
rchardx / hopper-gemm
View on GitHub
☆48Nov 1, 2025Updated 8 months ago
pigirons / conv3x3_m1
View on GitHub
This is a demo how to write a high performance convolution run on apple silicon
☆56Feb 8, 2022Updated 4 years ago
laanwj / decuda
View on GitHub
Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.
☆107Jul 24, 2010Updated 16 years ago
ISRC-CAS / PLCT-OpenDay-2019
View on GitHub
PLCT实验室2019年开放日资料（OpenDay-2019）
☆11Dec 20, 2019Updated 6 years ago
ligurio / jenny
View on GitHub
Tool for generating regression tests
☆16Mar 24, 2023Updated 3 years ago
wujun51227 / coffee-hdl
View on GitHub
coffeescript based hardware description language
☆14Jan 14, 2022Updated 4 years ago
OpenPPL / CuAssembler
View on GitHub
An unofficial cuda assembler, for all generations of SASS, hopefully ：）
☆85Mar 20, 2023Updated 3 years ago
nullplay / Unified-Convolution-Framework
View on GitHub
☆10Apr 24, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Jongy / gcc_assert_introspect
View on GitHub
A GCC plugin to insert pytest-like assert introspections
☆19Jun 6, 2020Updated 6 years ago
Bruce-Lee-LY / matrix_multiply
View on GitHub
Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.
☆14Feb 8, 2023Updated 3 years ago
randxie / mmdetection-tvm
View on GitHub
mmdetection -> TVM
☆15Aug 22, 2020Updated 5 years ago
Green077 / A-smartwatch-based-on-esp8266-with-MicroPython
View on GitHub
It's a project combined with hardware and software, the goal is to make a smart watch based on esp8266 chip. The smart watch has so many …
☆10Jul 9, 2019Updated 7 years ago
LeslieCorrea / Android-Face-Recognition
View on GitHub
Android Face Recognition uses Microsoft Project Oxford Face API for face detection and identification.
☆12Nov 13, 2015Updated 10 years ago
connorjan / llvm-cjg
View on GitHub
An LLVM backend for my custom 32-bit RISC CPU https://scholarworks.rit.edu/theses/9550/
☆14Aug 16, 2017Updated 8 years ago
true-grue / graph-irs
View on GitHub
Slides from a talk "Graph-Based Intermediate Representations: An Overview and Perspectives"
☆26Oct 22, 2023Updated 2 years ago