Ascend / toolsLinks
☆20Updated 2 years ago
Alternatives and similar repositories for tools
Users that are interested in tools are comparing it to the libraries listed below
Sorting:
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- Large Language Model Onnx Inference Framework☆36Updated 2 months ago
- A Toolkit to Help Optimize Large Onnx Model☆163Updated 3 months ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Updated 2 years ago
- Wanwu models release, code will be released soon☆24Updated 3 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Updated 2 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆42Updated 3 years ago
- ☆18Updated 2 years ago
- Transformer related optimization, including BERT, GPT☆17Updated 2 years ago
- [CVPR-2023] Towards Any Structural Pruning☆17Updated 2 years ago
- A codebase & model zoo for pretrained backbone based on MegEngine.☆32Updated 2 years ago
- MegEngine到其他框架的转换器☆70Updated 2 years ago
- Datasets, Transforms and Models specific to Computer Vision☆90Updated 2 years ago
- Serving Inside Pytorch☆170Updated 2 weeks ago
- ☆120Updated 2 years ago
- ☕️ A vscode extension for netron, support *.pdmodel, *.nb, *.onnx, *.pb, *.h5, *.tflite, *.pth, *.pt, *.mnn, *.param, etc.☆14Updated 2 years ago
- ☆104Updated 4 years ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆27Updated last year
- Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…☆37Updated 3 years ago
- ☆42Updated 3 years ago
- llm deploy project based onnx.☆49Updated last year
- ☆25Updated 2 years ago
- A set of examples around MegEngine☆31Updated 2 years ago
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆29Updated 4 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Updated 2 years ago
- 跨平台的容器化Linux桌面环境☆74Updated 11 months ago
- Whisper in TensorRT-LLM☆17Updated 2 years ago
- deploy onnx models with TensorRT and LibTorch☆19Updated 4 years ago
- ☆125Updated 2 years ago