Tencent / Forward
A library for high performance deep learning inference on NVIDIA GPUs.
☆552Updated 3 years ago
Alternatives and similar repositories for Forward:
Users that are interested in Forward are comparing it to the libraries listed below
- 服务侧深度学习部署案例☆451Updated 5 years ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆941Updated 8 months ago
- TensorRT Plugin Autogen Tool☆369Updated 2 years ago
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆501Updated 5 months ago
- A primitive library for neural network☆1,330Updated 4 months ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆855Updated 3 months ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆483Updated 5 months ago
- Deploy your model with TensorRT quickly.☆766Updated last year
- DeepLearning Framework Performance Profiling Toolkit☆285Updated 3 years ago
- ☆1,023Updated last year
- Compiler Infrastructure for Neural Networks☆145Updated last year
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆204Updated 4 years ago
- Dive into Deep Learning Compiler☆646Updated 2 years ago
- High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.☆533Updated 2 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆396Updated 2 years ago
- Simple Dynamic Batching Inference☆145Updated 3 years ago
- Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.☆267Updated 2 years ago
- Use PyTorch model in C++ project☆137Updated 3 years ago
- oneflow documentation☆68Updated 9 months ago
- heterogeneity-aware-lowering-and-optimization☆255Updated last year
- 通用深度学习推理工具,可在生产环境中快速上线由TensorFlow、PyTorch、Caffe框架训练出的深度学习模型。☆411Updated 3 years ago
- ☆127Updated 3 years ago
- TVM integration into PyTorch☆452Updated 5 years ago
- row-major matmul optimization☆619Updated last year
- Running BERT without Padding☆471Updated 3 years ago
- deepx_core是一个专注于张量计算/深度学习的基础库☆375Updated 2 years ago
- optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052☆471Updated last year
- Model Quantization Benchmark☆798Updated 2 months ago
- tensorflow源码阅读笔记☆190Updated 6 years ago
- Edge Machine Learning Library☆194Updated 2 years ago