chrjxj / awesome-gpu-notesLinks
☆14Updated 6 months ago
Alternatives and similar repositories for awesome-gpu-notes
Users that are interested in awesome-gpu-notes are comparing it to the libraries listed below
Sorting:
- symmetric int8 gemm☆66Updated 5 years ago
- implement bert in pure c++☆37Updated 5 years ago
- Use PyTorch model in C++ project☆139Updated 4 years ago
- Simple Dynamic Batching Inference☆145Updated 3 years ago
- kmeans clustering with multi-GPU capabilities☆122Updated 2 years ago
- ☆75Updated 3 years ago
- ☆129Updated 4 years ago
- ☆70Updated 3 years ago
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆73Updated 6 years ago
- Serving Inside Pytorch☆170Updated last week
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆18Updated 3 years ago
- Running BERT without Padding☆480Updated 3 years ago
- notes on reading tensorflow source code☆13Updated 7 years ago
- Tutorials for writing high-performance GPU operators in AI frameworks.☆136Updated 2 years ago
- A way to use cuda to accelerate top k algorithm☆30Updated 8 years ago
- ☆130Updated last year
- ☆57Updated 2 years ago
- ☆125Updated 2 years ago
- flexible-gemm conv of deepcore☆17Updated 6 years ago
- Fork of https://source.codeaurora.org/quic/hexagon_nn/nnlib☆58Updated 2 years ago
- Transformer related optimization, including BERT, GPT☆59Updated 2 years ago
- pytorch during training, libtorch during serving via gRPC☆21Updated 6 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Updated 2 years ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆477Updated last year
- ☆19Updated last year
- pdf☆94Updated 7 years ago
- Transformer related optimization, including BERT, GPT☆17Updated 2 years ago
- ☆21Updated 6 years ago
- ☆97Updated 4 years ago
- Edge Machine Learning Library☆199Updated 3 years ago