storypku / cuda-support-for-bazel
The missing CUDA support for Bazel on Linux
☆16Updated 2 years ago
Alternatives and similar repositories for cuda-support-for-bazel:
Users that are interested in cuda-support-for-bazel are comparing it to the libraries listed below
- Starlark implementation of bazel rules for CUDA.☆93Updated this week
- rules for bazel for build cuda code☆9Updated 7 years ago
- Python C++ Code Manager☆14Updated 2 months ago
- ☆20Updated 3 weeks ago
- ☆23Updated last year
- Benchmark of TVM quantized model on CUDA☆111Updated 4 years ago
- Deep Learning tools and applications for NVIDIA AGX platforms.☆168Updated this week
- convert torch module to tensorrt network or tvm function☆89Updated 4 years ago
- TensorFlow and TVM integration☆38Updated 4 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆146Updated last year
- Codebase associated with the PyTorch compiler tutorial☆44Updated 5 years ago
- Bazel wrapper around the pybind11 repository☆106Updated last month
- Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda☆14Updated last week
- This repository contains the results and code for the MLPerf™ Inference v1.0 benchmark.☆30Updated last year
- Example repository for custom C++/CUDA operators for TorchScript☆114Updated 2 years ago
- ☆45Updated 2 years ago
- ☆21Updated 7 years ago
- use nvcc compiler for cuda in bazel☆11Updated 6 years ago
- study of cutlass☆19Updated last month
- Parallel CUDA implementation of NON maximum Suppression☆79Updated 4 years ago
- Script for generating Spatial CNN caffe prototxt file.☆26Updated 6 years ago
- This repository describes how to add a custom TensorRT plugin in c++ and python☆27Updated 3 years ago
- Collection of CUDA benchmarks, with a focus on unified vs. explicit memory management.☆20Updated 5 years ago
- ☆42Updated 6 years ago
- ☆17Updated 4 years ago
- tophub autotvm log collections☆70Updated last year
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆86Updated 9 months ago
- notes on reading tensorflow source code☆13Updated 6 years ago
- cuDNN sample codes provided by Nvidia☆44Updated 5 years ago