Ascend / toolsLinks
☆20Updated 2 years ago
Alternatives and similar repositories for tools
Users that are interested in tools are comparing it to the libraries listed below
Sorting:
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆42Updated 3 years ago
- A codebase & model zoo for pretrained backbone based on MegEngine.☆32Updated 2 years ago
- A Toolkit to Help Optimize Large Onnx Model☆163Updated 3 months ago
- Large Language Model Onnx Inference Framework☆36Updated 2 months ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Updated 2 years ago
- Wanwu models release, code will be released soon☆24Updated 3 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Updated 2 years ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- MegEngine到其他框架的转换器☆70Updated 2 years ago
- ☆18Updated 2 years ago
- ☆14Updated 4 years ago
- Serving Inside Pytorch☆170Updated this week
- 手摸手 美团 YOLOv6模型训练和TensorRT端到端部署方案教程☆34Updated 3 years ago
- ☕️ A vscode extension for netron, support *.pdmodel, *.nb, *.onnx, *.pb, *.h5, *.tflite, *.pth, *.pt, *.mnn, *.param, etc.☆14Updated 2 years ago
- ☆28Updated 7 months ago
- A set of examples around MegEngine☆31Updated 2 years ago
- ☆120Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Updated 2 years ago
- ☆104Updated 4 years ago
- ☆42Updated 3 years ago
- Transformer related optimization, including BERT, GPT☆17Updated 2 years ago
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆27Updated last year
- 高效部署:YOLO X, V3, V4, V5, V6, V7, V8, EdgeYOLO TRT推理 ™️ ,前后处理均由CUDA核函数实现 CPP/CUDA🚀☆53Updated 2 years ago
- llm deploy project based onnx.☆49Updated last year
- 🐱 ncnn int8 模型量化评估☆14Updated 3 years ago
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆35Updated 3 years ago
- ☆47Updated 2 years ago
- Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…☆37Updated 3 years ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Updated last year