bigeagle / picoGPTLinks
☆41Updated 2 years ago
Alternatives and similar repositories for picoGPT
Users that are interested in picoGPT are comparing it to the libraries listed below
Sorting:
- This is a demo how to write a high performance convolution run on apple silicon☆54Updated 3 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 4 years ago
- Programming exercises for kids (no prior programming experience required)☆15Updated last year
- Efficient inference of large language models.☆149Updated last month
- Static analysis framework for analyzing programs written in TVM's Relay IR.☆28Updated 5 years ago
- ☆11Updated 4 years ago
- ☆75Updated last month
- Summary of system papers/frameworks/codes/tools on training or serving large model☆57Updated last year
- Triton adapter for Ascend. Mirror of https://gitee.com/ascend/triton-ascend☆59Updated this week
- GPTQ inference TVM kernel☆40Updated last year
- ☆12Updated 2 years ago
- my dotfiles..☆62Updated 3 months ago
- A library for syntactically rewriting Python programs, pronounced (sinner).☆69Updated 3 years ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated last year
- Standalone Flash Attention v2 kernel without libtorch dependency☆111Updated 10 months ago
- ☆28Updated last month
- Framework to reduce autotune overhead to zero for well known deployments.☆79Updated this week
- ☆16Updated this week
- ☆19Updated 9 months ago
- ☆22Updated 5 years ago
- DLPack for Tensorflow☆35Updated 5 years ago
- An experimental ahead of time compiler for Relay.☆50Updated 5 years ago
- ☆124Updated last year
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 3 years ago
- A collection of reproducible inference engine benchmarks☆32Updated 2 months ago
- PTX on XPUs☆36Updated this week
- ONNX Command-Line Toolbox☆35Updated 9 months ago
- Noisy language compiler☆17Updated 11 months ago