ShigureLab / python-lib-starterLinks
Just a template for quickly creating a python library.
☆10Updated last week
Alternatives and similar repositories for python-lib-starter
Users that are interested in python-lib-starter are comparing it to the libraries listed below
Sorting:
- PFCC 社区博客☆13Updated this week
- 飞桨护航计划集训营☆20Updated this week
- Triton Documentation in Chinese Simplified / Triton 中文文档☆96Updated last week
- An experimental project for paddle python IR.☆15Updated 2 years ago
- 【HACKATHON 预备营】飞桨启航计划集训营☆17Updated this week
- 🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.☆242Updated last month
- Getting Started with Triton: A Tutorial for Python Beginners☆27Updated 2 months ago
- A Bytecode level Implementation of Symbolic OpCode Translator For PaddlePaddle☆16Updated 2 years ago
- OneFlow->ONNX☆43Updated 2 years ago
- ☕️ A vscode extension for netron, support *.pdmodel, *.nb, *.onnx, *.pb, *.h5, *.tflite, *.pth, *.pt, *.mnn, *.param, etc.☆14Updated 2 years ago
- ☆16Updated last year
- ☆91Updated 3 weeks ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆63Updated last year
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆119Updated this week
- https://start.oneflow.org/oneflow-yolo-doc☆22Updated 2 years ago
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- Serving Inside Pytorch☆167Updated 2 weeks ago
- ☆68Updated last week
- Awesome code, projects, books, etc. related to CUDA☆28Updated last week
- A light llama-like llm inference framework based on the triton kernel.☆167Updated 3 months ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- Tutorials for writing high-performance GPU operators in AI frameworks.☆132Updated 2 years ago
- llm deploy project based onnx.☆47Updated last year
- ⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.☆138Updated 7 months ago
- An Android Application for GLCC☆11Updated 3 years ago
- ☆141Updated last year
- 分层解耦的深度学习推理引擎☆78Updated 10 months ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Updated 2 years ago
- ☆97Updated 9 months ago
- ☆39Updated 7 months ago