huawei-noah / boltLinks
Bolt is a deep learning library with high performance and heterogeneous flexibility.
☆953Updated 7 months ago
Alternatives and similar repositories for bolt
Users that are interested in bolt are comparing it to the libraries listed below
Sorting:
- A library for high performance deep learning inference on NVIDIA GPUs.☆557Updated 3 years ago
- A primitive library for neural network☆1,369Updated 11 months ago
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆404Updated 2 years ago
- High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.☆534Updated 3 years ago
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆512Updated last year
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆995Updated last year
- Model Quantization Benchmark☆847Updated 6 months ago
- Dive into Deep Learning Compiler☆646Updated 3 years ago
- ☆1,038Updated last year
- Everything in Torch Fx☆345Updated last year
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆488Updated last year
- Adlik: Toolkit for Accelerating Deep Learning Inference☆806Updated last year
- Benchmarking Neural Network Inference on Mobile Devices☆383Updated 2 years ago
- TVM integration into PyTorch☆454Updated 5 years ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆904Updated 10 months ago
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆854Updated 2 months ago
- row-major matmul optimization☆684Updated 2 months ago
- TensorRT Plugin Autogen Tool☆368Updated 2 years ago
- Generate a quantization parameter file for ncnn framework int8 inference☆517Updated 5 years ago
- FeatherCNN is a high performance inference engine for convolutional neural networks.☆1,222Updated 6 years ago
- Deploy your model with TensorRT quickly.☆765Updated last year
- A parser, editor and profiler tool for ONNX models.☆462Updated last week
- ONNX Optimizer☆770Updated last week
- ☆669Updated 4 years ago
- heterogeneity-aware-lowering-and-optimization☆256Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,535Updated 3 months ago
- Edge Machine Learning Library☆196Updated 3 years ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,263Updated 6 months ago
- AutoML tools chain☆852Updated 2 years ago