Ascend / tools
☆15Updated last year
Alternatives and similar repositories for tools:
Users that are interested in tools are comparing it to the libraries listed below
- An easy way to run, test, benchmark and tune OpenCL kernel files☆23Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆41Updated last year
- Wanwu models release, code will be released soon☆24Updated 2 years ago
- This repository contains the results and code for the MLPerf™ Inference v2.1 benchmark.☆18Updated last year
- ☆13Updated last year
- [CVPR-2023] Towards Any Structural Pruning☆16Updated last year
- A codebase & model zoo for pretrained backbone based on MegEngine.☆33Updated last year
- ☕️ A vscode extension for netron, support *.pdmodel, *.nb, *.onnx, *.pb, *.h5, *.tflite, *.pth, *.pt, *.mnn, *.param, etc.☆12Updated last year
- ☆97Updated 3 years ago
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Updated last year
- paddle code convert toolkit☆22Updated last year
- Transformer related optimization, including BERT, GPT☆17Updated last year
- MegEngine到其他框架的转换器☆69Updated last year
- ☆69Updated last year
- Yet another Polyhedra Compiler for DeepLearning☆19Updated last year
- ONNX Command-Line Toolbox☆35Updated 4 months ago
- Utility scripts for editing or modifying onnx models. Utility scripts to summarize onnx model files along with visualization for loop ope…☆79Updated 3 years ago
- Trans different platform's network to International Representation(IR)☆44Updated 6 years ago
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆27Updated 3 years ago
- ☆18Updated last year
- symmetric int8 gemm☆66Updated 4 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆20Updated 11 months ago
- An object detection codebase based on MegEngine.☆28Updated 2 years ago
- Simple CuDNN wrapper☆29Updated 9 years ago
- ☆23Updated last year
- ☆13Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated 8 months ago
- quantize aware training package for NCNN on pytorch☆70Updated 3 years ago
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Updated last year
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆44Updated last year