ShigureLab / python-lib-starterLinks
Just a template for quickly creating a python library.
☆10Updated last month
Alternatives and similar repositories for python-lib-starter
Users that are interested in python-lib-starter are comparing it to the libraries listed below
Sorting:
- PFCC 社区博客☆14Updated this week
- An experimental project for paddle python IR.☆15Updated 2 years ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆102Updated last month
- ☆16Updated last year
- 飞桨护航计划集训营☆19Updated last week
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆121Updated 2 weeks ago
- 【HACKATHON 预备营】飞桨启航计划集训营☆17Updated last month
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆27Updated last year
- Serving Inside Pytorch☆170Updated 2 weeks ago
- OneFlow->ONNX☆43Updated 2 years ago
- 🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.☆248Updated 2 weeks ago
- ☆74Updated this week
- 使用 CUDA C++ 实现的 llama 模型推理框架☆64Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Updated last year
- ☕️ A vscode extension for netron, support *.pdmodel, *.nb, *.onnx, *.pb, *.h5, *.tflite, *.pth, *.pt, *.mnn, *.param, etc.☆14Updated 2 years ago
- Music large model based on InternLM2-chat.☆23Updated last year
- ☆120Updated 2 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- A Bytecode level Implementation of Symbolic OpCode Translator For PaddlePaddle☆16Updated 2 years ago
- PaddlePaddle Developer Community☆134Updated this week
- ☆125Updated 2 years ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆26Updated 2 weeks ago
- Tutorials for writing high-performance GPU operators in AI frameworks.☆136Updated 2 years ago
- ☆141Updated last year
- https://start.oneflow.org/oneflow-yolo-doc☆23Updated 2 years ago
- 分层解耦的深度学习推理引擎☆79Updated 11 months ago
- GLM Series Edge Models☆157Updated 7 months ago
- Awesome code, projects, books, etc. related to CUDA☆30Updated last month
- Large Language Model Onnx Inference Framework☆36Updated 2 months ago
- A Toolkit to Help Optimize Large Onnx Model☆163Updated 3 months ago