Peter-Chou / libtorch_grpc_servingLinks

pytorch during training, libtorch during serving via gRPC

☆21

Alternatives and similar repositories for libtorch_grpc_serving

Users that are interested in libtorch_grpc_serving are comparing it to the libraries listed below

Sorting:

dhpollack / huggingface_libtorch
Minimal example of using a traced huggingface transformers model with libtorch
☆35Updated 4 years ago
LeeJuly30 / BERTCpp
implement bert in pure c++
☆35Updated 5 years ago
ericperfect / libtorch_tokenizer
BERT Tokenizer in C++
☆77Updated 4 years ago
sunbelbd / mobius
Möbius Transformation for Fast Inner Product Search on Graph
☆22Updated 4 years ago
Peter-Chou / transformer_cpp_tokenizers
transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)
☆17Updated 3 years ago
Wizaron / pytorch-cpp-inference
Serving PyTorch 1.0 Models as a Web Server in C++
☆226Updated 5 years ago
DeepVAC / libdeepvac
Use PyTorch model in C++ project
☆139Updated 4 years ago
AI-OP / edge-brain
Simple examples of using bazel to cross compile AI applicaions for armv7hf devices.
☆25Updated 3 years ago
Abhijit-2592 / model-server
gRPC server for hosting ML models trained on any framework in python
☆78Updated last year
lucasjinreal / wanwu_release
Wanwu models release, code will be released soon
☆24Updated 2 years ago
EdVince / whisper-trtllm
Whisper in TensorRT-LLM
☆16Updated last year
nihui / ncnn-webassembly-scrfd
Deploy SCRFD, an efficient high accuracy face detection approach, in your web browser with ncnn and webassembly
☆51Updated 2 years ago
Arctanxy / ToyNet
用C++和Python实现从头实现一个深度学习训练框架
☆12Updated 4 years ago
Sundy1219 / ctc_beam_search_lm
CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统
☆48Updated 7 years ago
watersink / grpc_tensorflow_demo
tensorflow mnist demo api interface，include grpc,flask,webpy,tornado,django,rabbitMQ,redis,celery,tf serving，freeze_optimize_quantize
☆20Updated 3 years ago
leoluopy / autotvm_tutorial
autoTVM神经网络推理代码优化搜索演示，基于tvm编译开源模型centerface，并使用autoTVM搜索最优推理代码，　最终部署编译为c++代码，演示平台是cuda，可以是其他平台，例如树莓派，安卓手机，苹果手机．Thi is a demonstration of …
☆27Updated 4 years ago
kuangliu / pytorch-agender
Predict age & gender in one model
☆19Updated 7 years ago
faedtodd / Tensorrt-yolov3-win10
onnx-tensorrt for yolov3
☆30Updated 6 years ago
jack-willturner / pytorch-onnx-tvm
PyTorch -> ONNX -> TVM for autotuning
☆24Updated 5 years ago
Zhengtq / ncnn_breakdown
A breakdown of NCNN
☆46Updated 4 years ago
markson14 / FaceRecognitionCpp
Large input size REAL-TIME Face Detector on Cpp. It can also support face verification using MobileFaceNet+Arcface with real-time inferen…
☆52Updated 4 years ago
lucasjinreal / tfboys
TensorFlow and Pytorch practice codes with purity and simplicity.
☆33Updated 5 years ago
LieluoboAi / radish
C++ model train&inference framework
☆223Updated 5 years ago
xiangyangkan / faster-bert-as-service
Using TensorRT and Triton Server to build BERT model as a service
☆13Updated 3 years ago
markwwen / ServingAgent
A simple middleware to improving GPU utilization then speedup online inference.
☆19Updated 4 years ago
csukuangfj / OpenCNN
An Open Convolutional Neural Network Framework in C++ From Scratch
☆66Updated 4 years ago
srihari-humbarwadi / TensorRT-for-keras
Optimizing keras models using Nvidia TensorRT
☆13Updated 5 years ago
huismiling / wenet_trt8
☆75Updated 3 years ago
tutorials-with-ci / tensorflow-quantization-example
TensorFlow Quantization Example, for TensorFlow Lite
☆18Updated 6 years ago
miemie2013 / Pure_Python_Deep_Learning
纯Python实现的深度学习框架，帮助你理解底层细节斩获offer
☆20Updated 2 years ago