YingkunZhou / EdgeTransformerBenchLinks
edge/mobile transformer based Vision DNN inference benchmark
☆16Updated last month
Alternatives and similar repositories for EdgeTransformerBench
Users that are interested in EdgeTransformerBench are comparing it to the libraries listed below
Sorting:
- C++ implementations for various tokenizers (sentencepiece, tiktoken etc).☆36Updated this week
- Tencent NCNN with added CUDA support☆70Updated 4 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆10Updated 3 years ago
- Common libraries for PPL projects☆29Updated 7 months ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Updated 2 years ago
- Extension package of Apache TVM (Machine Learning Compiler) for Renesas DRP-AI accelerators powered by Edgecortix MERA(TM) Based Apache T…☆54Updated 3 weeks ago
- Compass Apache TVM is enhanced based on the Apache TVM for wide range of Neural Network (NN) models quick support, optimization and heter…☆20Updated this week
- ☆24Updated 2 years ago
- Convert tflite to JSON and make it editable in the IDE. It also converts the edited JSON back to tflite binary.☆27Updated 2 years ago
- ☆17Updated last year
- PyTorch -> ONNX -> TVM for autotuning☆24Updated 5 years ago
- symmetric int8 gemm☆67Updated 5 years ago
- ☆41Updated 2 years ago
- Sandbox for TVM and playing around!☆22Updated 2 years ago
- VeriSilicon Tensor Interface Module☆238Updated 2 weeks ago
- Yet another Polyhedra Compiler for DeepLearning☆19Updated 2 years ago
- Inference of quantization aware trained networks using TensorRT☆83Updated 2 years ago
- Large Language Model Onnx Inference Framework☆36Updated 9 months ago
- Count number of parameters / MACs / FLOPS for ONNX models.☆94Updated last year
- ☆17Updated 5 years ago
- ☆68Updated 2 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Updated 2 years ago
- ONNX converter and optimizer scirpts for Kneron hardware.☆40Updated last year
- MegEngine到其他框架的转换器☆70Updated 2 years ago
- This is a demo how to write a high performance convolution run on apple silicon☆56Updated 3 years ago
- Tengine 管子是用来快速生产 demo 的辅助工具☆13Updated 4 years ago
- ONNX Command-Line Toolbox☆35Updated last year
- ☆25Updated 4 years ago
- Model compression for ONNX☆97Updated 11 months ago
- Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda☆19Updated last month