ShigureLab / python-lib-starterLinks
Just a template for quickly creating a python library.
☆10Updated last week
Alternatives and similar repositories for python-lib-starter
Users that are interested in python-lib-starter are comparing it to the libraries listed below
Sorting:
- PFCC 社区博客☆13Updated this week
- An experimental project for paddle python IR.☆15Updated 2 years ago
- 飞桨护航计划集训营☆20Updated last month
- 【HACKATHON 预备营】飞桨启航计划集训营☆17Updated last week
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆119Updated this week
- Triton Documentation in Chinese Simplified / Triton 中文文档☆94Updated 2 weeks ago
- ☆16Updated last year
- ☆65Updated last week
- A Bytecode level Implementation of Symbolic OpCode Translator For PaddlePaddle☆16Updated 2 years ago
- Music large model based on InternLM2-chat.☆22Updated 11 months ago
- OneFlow->ONNX☆43Updated 2 years ago
- Serving Inside Pytorch☆165Updated 2 weeks ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- 🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.☆233Updated 2 weeks ago
- ☕️ A vscode extension for netron, support *.pdmodel, *.nb, *.onnx, *.pb, *.h5, *.tflite, *.pth, *.pt, *.mnn, *.param, etc.☆14Updated 2 years ago
- simplify >2GB large onnx model☆69Updated last year
- 使用 CUDA C++ 实现的 llama 模型推理框架☆62Updated last year
- llm deploy project based onnx.☆47Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆162Updated last month
- ☆140Updated last year
- ☆125Updated last year
- UltraScale Playbook 中文版☆93Updated 8 months ago
- A light llama-like llm inference framework based on the triton kernel.☆166Updated 2 months ago
- Large Language Model Onnx Inference Framework☆36Updated last week
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Updated 2 years ago
- ☆38Updated last year
- Tutorials for writing high-performance GPU operators in AI frameworks.☆133Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Updated 2 years ago
- ☆52Updated last year