ShigureLab / python-lib-starter
Just a template for quickly creating a python library.
☆8Updated last month
Alternatives and similar repositories for python-lib-starter:
Users that are interested in python-lib-starter are comparing it to the libraries listed below
- PFCC 社区博客☆11Updated this week
- ☆16Updated last year
- 飞桨护航计划集训营☆18Updated this week
- Awesome code, projects, books, etc. related to CUDA☆16Updated last week
- A Bytecode level Implementation of Symbolic OpCode Translator For PaddlePaddle☆16Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- An experimental project for paddle python IR.☆15Updated last year
- OneFlow Serving☆20Updated 2 weeks ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆15Updated last year
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Updated 2 years ago
- llm deploy project based onnx.☆36Updated 6 months ago
- cpp syntactic sugar☆9Updated 7 months ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆50Updated 5 months ago
- ⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.☆73Updated 3 weeks ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆67Updated last week
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- A practical way of learning Swizzle☆18Updated 2 months ago
- 🐱 ncnn int8 模型量化评估☆13Updated 2 years ago
- Music large model based on InternLM2-chat.☆22Updated 4 months ago
- 【HACKATHON 预备营】飞桨启航计划集训营☆16Updated this week
- OneFlow->ONNX☆43Updated 2 years ago
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆17Updated 7 months ago
- 📚FFPA(Split-D): Yet another Faster Flash Attention with O(1) GPU SRAM complexity large headdim, 1.8x~3x↑🎉 faster than SDPA EA.☆169Updated 3 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated 10 months ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆23Updated last year
- ☆124Updated last year
- ☕️ A vscode extension for netron, support *.pdmodel, *.nb, *.onnx, *.pb, *.h5, *.tflite, *.pth, *.pt, *.mnn, *.param, etc.☆13Updated last year
- ☆63Updated 5 months ago
- 分层解耦的深度学习推理引擎☆72Updated 2 months ago
- MegEngine到其他框架的转换器☆69Updated 2 years ago