markwwen / ServingAgentLinks
A simple middleware to improving GPU utilization then speedup online inference.
☆19Updated 4 years ago
Alternatives and similar repositories for ServingAgent
Users that are interested in ServingAgent are comparing it to the libraries listed below
Sorting:
- gRPC server for hosting ML models trained on any framework in python☆78Updated 3 months ago
- Colab notebooks for d2l-book☆11Updated 5 years ago
- 图片简易标注工具,标注类似ICDAR数据集,支持多边形标注,文本标注,方便OCR数据集标注。☆55Updated 6 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 5 years ago
- Easy Multiprocessing for Python☆43Updated 5 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 7 years ago
- Automatically build the deep learning models with ENAS☆31Updated 7 years ago
- Code for the paper "Understanding the Role of Momentum in Stochastic Gradient Methods"☆14Updated 6 years ago
- The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Natu…☆48Updated 4 years ago
- ☆13Updated 7 years ago
- Wanwu models release, code will be released soon☆24Updated 3 years ago
- List of papers that applied graph network to NLP☆13Updated 6 years ago
- 以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注意力Transformer结构的斗地主ai☆94Updated 4 years ago
- code scripts for blog posts I published☆13Updated 5 years ago
- Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.☆38Updated 2 years ago
- AutoTorch, A HPO Toolkit☆60Updated 5 years ago
- ☆26Updated 6 years ago
- A pytorch based classification experiments template☆46Updated 4 years ago
- 国内外数据竞赛资讯整理☆18Updated 4 years ago
- tunz's CUDA pytorch operator (MaskedSoftmax)☆75Updated 6 years ago
- A mxnet object detection library contains implementations of RFCN, FCOS, RetinaNet, OpenPose, etc..☆31Updated 4 years ago
- Implementation for <Neural Similarity Learning> in NeurIPS'19.☆33Updated 5 years ago
- Arxiv Sanity with novel paper search☆41Updated 6 years ago
- ☆24Updated 4 years ago
- This is the re-implementation of group normalization in MXNet Symbol,Module and Gluon☆23Updated 6 years ago
- PyTorch 1.0 inference in C++ on Windows10 platforms☆89Updated 6 years ago
- Deep Neural Network Compression based on Student-Teacher Network☆14Updated 2 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 7 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 3 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆62Updated last year