Peter-Chou / libtorch_grpc_servingLinks
pytorch during training, libtorch during serving via gRPC
☆21Updated 5 years ago
Alternatives and similar repositories for libtorch_grpc_serving
Users that are interested in libtorch_grpc_serving are comparing it to the libraries listed below
Sorting:
- Minimal example of using a traced huggingface transformers model with libtorch☆35Updated 4 years ago
- implement bert in pure c++☆35Updated 5 years ago
- BERT Tokenizer in C++☆77Updated 4 years ago
- Möbius Transformation for Fast Inner Product Search on Graph☆22Updated 4 years ago
- transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)☆17Updated 3 years ago
- Serving PyTorch 1.0 Models as a Web Server in C++☆226Updated 5 years ago
- Use PyTorch model in C++ project☆139Updated 4 years ago
- Simple examples of using bazel to cross compile AI applicaions for armv7hf devices.☆25Updated 3 years ago
- gRPC server for hosting ML models trained on any framework in python☆78Updated last year
- Wanwu models release, code will be released soon☆24Updated 2 years ago
- Whisper in TensorRT-LLM☆16Updated last year
- Deploy SCRFD, an efficient high accuracy face detection approach, in your web browser with ncnn and webassembly☆51Updated 2 years ago
- 用C++和Python实现从头实现一个深度学习训练框架☆12Updated 4 years ago
- CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统☆48Updated 7 years ago
- tensorflow mnist demo api interface,include grpc,flask,webpy,tornado,django,rabbitMQ,redis,celery,tf serving,freeze_optimize_quantize☆20Updated 3 years ago
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆27Updated 4 years ago
- Predict age & gender in one model☆19Updated 7 years ago
- onnx-tensorrt for yolov3☆30Updated 6 years ago
- PyTorch -> ONNX -> TVM for autotuning☆24Updated 5 years ago
- A breakdown of NCNN☆46Updated 4 years ago
- Large input size REAL-TIME Face Detector on Cpp. It can also support face verification using MobileFaceNet+Arcface with real-time inferen…☆52Updated 4 years ago
- TensorFlow and Pytorch practice codes with purity and simplicity.☆33Updated 5 years ago
- C++ model train&inference framework☆223Updated 5 years ago
- Using TensorRT and Triton Server to build BERT model as a service☆13Updated 3 years ago
- A simple middleware to improving GPU utilization then speedup online inference.☆19Updated 4 years ago
- An Open Convolutional Neural Network Framework in C++ From Scratch☆66Updated 4 years ago
- Optimizing keras models using Nvidia TensorRT☆13Updated 5 years ago
- ☆75Updated 3 years ago
- TensorFlow Quantization Example, for TensorFlow Lite☆18Updated 6 years ago
- 纯Python实现的深度学习框架,帮助你理解底层细节斩获offer☆20Updated 2 years ago