markwwen / ServingAgentLinks

A simple middleware to improving GPU utilization then speedup online inference.

☆19

Alternatives and similar repositories for ServingAgent

Users that are interested in ServingAgent are comparing it to the libraries listed below

Sorting:

Abhijit-2592 / model-server
gRPC server for hosting ML models trained on any framework in python
☆78Updated last year
d2l-ai / d2l-book-colab
Colab notebooks for d2l-book
☆11Updated 5 years ago
RahulBhalley / turing-gan
Source code for "Training Generative Adversarial Networks Via Turing Test".
☆13Updated 5 years ago
vandyyu / dataset_labeling
图片简易标注工具，标注类似ICDAR数据集，支持多边形标注，文本标注，方便OCR数据集标注。
☆54Updated 6 years ago
WarBean / emp
Easy Multiprocessing for Python
☆43Updated 4 years ago
alondj / Pytorch-Gpipe
☆26Updated 5 years ago
tobegit3hub / enas_model
Automatically build the deep learning models with ENAS
☆31Updated 7 years ago
merrymercy / NALU
Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)
☆31Updated 6 years ago
Kipok / understanding-momentum
Code for the paper "Understanding the Role of Momentum in Stochastic Gradient Methods"
☆14Updated 5 years ago
pletessier / taranis
Similarity search engine built around Faiss library
☆77Updated 2 years ago
ChristopherSweeney / SlimNets
Various implementations and experimentation for deep neural network model compression
☆24Updated 6 years ago
markusnagel / tf-faster-rcnn
Tensorflow Faster RCNN
☆7Updated 8 years ago
learning-luke / pytorch-experiments-template
A pytorch based classification experiments template
☆46Updated 4 years ago
lucasjinreal / wanwu_release
Wanwu models release, code will be released soon
☆24Updated 2 years ago
SHTUPLUS / ContextLab
ContextLab: A Toolbox for Context Feature Augmentation developed with PyTorch
☆39Updated 5 years ago
mkolod / fast_upsampling
☆33Updated last year
tunz / tcop-pytorch
tunz's CUDA pytorch operator (MaskedSoftmax)
☆75Updated 6 years ago
Zhengyu-Li / Deep-Network-Compression-based-on-Student-Teacher-Network-
Deep Neural Network Compression based on Student-Teacher Network
☆14Updated 2 years ago
paper-submissions / norm_matters
☆13Updated 7 years ago
MAC-AutoML / YOCO-BERT
The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Natu…
☆48Updated 4 years ago
luuuyi / ShuffleNetV2_vs_MnasNet.PyTorch
Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)
☆12Updated 6 years ago
zccyman / pytorch-inference
PyTorch 1.0 inference in C++ on Windows10 platforms
☆89Updated 6 years ago
opringle / gluonrank
Ranking made easy
☆36Updated 6 years ago
StacyYang / AutoTorch
AutoTorch, A HPO Toolkit
☆60Updated 5 years ago
catalyst-team / awesome-catalyst-list
☆54Updated 5 years ago
Zehaos / pycaffe-yolo
YOLO reimplement in caffe, written with python layer.
☆13Updated 8 years ago
zhengying-liu / autodl_starting_kit_stable
Starting kit for AutoCV/AutoDL challenge (https://autodl.chalearn.org)
☆40Updated 5 years ago
microsoft / GEM
☆24Updated 4 years ago
Shujian2015 / graphnet_nlp_paper
List of papers that applied graph network to NLP
☆13Updated 6 years ago
MccreeZhao / QAMFace
Pytorch implementation of Quadratic Additive Angular Margin Loss for Face Recognition
☆35Updated 4 years ago