leimao / Simple-Inference-ServerLinks

Inference Server Implementation from Scratch for Machine Learning Models

☆24

Alternatives and similar repositories for Simple-Inference-Server

Users that are interested in Simple-Inference-Server are comparing it to the libraries listed below

Sorting:

zhenhuaw-me / onnxcli
ONNX Command-Line Toolbox
☆35Updated 9 months ago
mcarilli / mixed_precision_references
Personal collection of references for high performance mixed precision training.
☆41Updated 5 years ago
eedalong / Dpex
Distributed DataLoader For Pytorch Based On Ray
☆24Updated 3 years ago
scailable / sclblonnx
Scailable ONNX python tools
☆96Updated 8 months ago
DeMoriarty / custom_matmul_kernels
Customized matrix multiplication kernels
☆56Updated 3 years ago
torchexpo / torchexpo
Collection of models and extensions for deployment in PyTorch
☆24Updated 2 years ago
leimao / LibTorch-ResNet-CIFAR
ResNet Implementation, Training, and Inference Using LibTorch C++ API
☆42Updated last year
marsupialtail / sparsednn
Fast sparse deep learning on CPUs
☆53Updated 2 years ago
pytorch / multipy
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…
☆180Updated this week
kshitij12345 / torchnnprofiler
Context Manager to profile the forward and backward times of PyTorch's nn.Module
☆83Updated last year
sdpython / mlprodict
Productionize machine learning predictions, with ONNX or without
☆65Updated last year
pytorch / extension-script
Example repository for custom C++/CUDA operators for TorchScript
☆114Updated 2 years ago
facebookresearch / fairring
Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …
☆65Updated 3 years ago
adityaiitb / PyProf
A GPU performance profiling tool for PyTorch models
☆22Updated 3 years ago
Harry-Chen / InfMoE
Inference framework for MoE layers based on TensorRT with Python binding
☆41Updated 4 years ago
jongwook / tfrecord_lite
Make TFRecord Usable Again
☆88Updated 2 years ago
csukuangfj / OpenCNN
An Open Convolutional Neural Network Framework in C++ From Scratch
☆65Updated 4 years ago
adityaiitb / pyprof2
PyProf2: PyTorch Profiling tool
☆82Updated 5 years ago
Quansight / pytest-pytorch
pytest plugin for a better developer experience when working with the PyTorch test suite
☆44Updated 3 years ago
fpaupier / gRPC-multiprocessing
A boilerplate to use multiprocessing for your gRPC server in your Python project
☆26Updated 3 years ago
zhenhuaw-me / tflite
Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/
☆102Updated 5 months ago
VoVAllen / tf-dlpack
DLPack for Tensorflow
☆35Updated 5 years ago
intel / optimized-models
☆26Updated 2 years ago
usyd-fsalab / NeuralNetworkRandomness
☆14Updated 3 years ago
fumihwh / onnx-pytorch
A code generator from ONNX to PyTorch code
☆138Updated 2 years ago
markwwen / ServingAgent
A simple middleware to improving GPU utilization then speedup online inference.
☆19Updated 4 years ago
minitorch / Module-1
Module 1 - Autodifferentiation
☆22Updated 10 months ago
bwasti / pytorch_compiler_tutorial
Codebase associated with the PyTorch compiler tutorial
☆46Updated 5 years ago
snuspl / parallax
A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.
☆132Updated 3 years ago
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆154Updated 2 weeks ago