chrjxj / awesome-gpu-notesLinks
☆14Updated 6 months ago
Alternatives and similar repositories for awesome-gpu-notes
Users that are interested in awesome-gpu-notes are comparing it to the libraries listed below
Sorting:
- symmetric int8 gemm☆66Updated 5 years ago
- kmeans clustering with multi-GPU capabilities☆122Updated 2 years ago
- Simple Dynamic Batching Inference☆145Updated 3 years ago
- A way to use cuda to accelerate top k algorithm☆30Updated 8 years ago
- Use PyTorch model in C++ project☆139Updated 4 years ago
- ☆129Updated 4 years ago
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆73Updated 6 years ago
- ☆75Updated 3 years ago
- This is a c++ implementation of an LSTM Neural Network parallelized for a GPU using CUDA☆25Updated 8 years ago
- Serving Inside Pytorch☆170Updated last week
- ☆70Updated 3 years ago
- implement bert in pure c++☆37Updated 5 years ago
- flexible-gemm conv of deepcore☆17Updated 6 years ago
- A Fast Muti-processing BERT-Inference System☆102Updated 3 years ago
- OneFlow models for benchmarking.☆104Updated last year
- PyTorch -> ONNX -> TVM for autotuning☆24Updated 5 years ago
- ☆21Updated 6 years ago
- notes on reading tensorflow source code☆13Updated 7 years ago
- Example repository for custom C++/CUDA operators for TorchScript☆114Updated 3 years ago
- pytorch during training, libtorch during serving via gRPC☆21Updated 6 years ago
- A communication library for deep learning☆51Updated last year
- InsNet Runs Instance-dependent Neural Networks with Padding-free Dynamic Batching.☆67Updated 4 years ago
- DeepLearning Framework Performance Profiling Toolkit☆296Updated 3 years ago
- CUDA by practice☆137Updated 6 years ago
- Running BERT without Padding☆480Updated 3 years ago
- TensorRT Plugin Autogen Tool☆366Updated 2 years ago
- Tutorials for writing high-performance GPU operators in AI frameworks.☆136Updated 2 years ago
- 动手学习TVM核心原理教程☆64Updated 5 years ago
- heterogeneity-aware-lowering-and-optimization☆257Updated 2 years ago
- My learning notes about AI, including Machine Learning and Deep Learning.☆18Updated 6 years ago