markwwen / ServingAgentLinks
A simple middleware to improving GPU utilization then speedup online inference.
☆19Updated 4 years ago
Alternatives and similar repositories for ServingAgent
Users that are interested in ServingAgent are comparing it to the libraries listed below
Sorting:
- gRPC server for hosting ML models trained on any framework in python☆78Updated last month
- Colab notebooks for d2l-book☆11Updated 5 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 5 years ago
- tunz's CUDA pytorch operator (MaskedSoftmax)☆75Updated 6 years ago
- Easy Multiprocessing for Python☆43Updated 4 years ago
- 图片简易标注工具,标注类似ICDAR数据集,支持多边形标注,文本标注,方便OCR数据集标注。☆55Updated 6 years ago
- ☆26Updated 5 years ago
- The official implementation of You Only Compress Once: Towards Effective and Elastic BERT Compression via Exploit-Explore Stochastic Natu…☆48Updated 4 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Updated 6 years ago
- Automatically build the deep learning models with ENAS☆31Updated 7 years ago
- Similarity search engine built around Faiss library☆78Updated 2 years ago
- Code for the paper "Understanding the Role of Momentum in Stochastic Gradient Methods"☆14Updated 5 years ago
- ☆24Updated 4 years ago
- ☆13Updated 7 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 7 years ago
- A pytorch based classification experiments template☆46Updated 4 years ago
- List of papers that applied graph network to NLP☆13Updated 6 years ago
- tf2.0 implementation of circle loss☆32Updated 5 years ago
- Deep Neural Network Compression based on Student-Teacher Network☆14Updated 2 years ago
- code scripts for blog posts I published☆13Updated 5 years ago
- AutoTorch, A HPO Toolkit☆60Updated 5 years ago
- Code for "Free-Lunch Saliency via Attention in Atari Agents"☆16Updated 4 years ago
- Reversible Recurrent Neural Network Pytorch Implementation☆21Updated 7 years ago
- A very naive and simple benchmark between dlib and pytorch in terms of space and time☆19Updated 5 years ago
- ☆33Updated last year
- 有关深度学习的面试题目(大多源于牛客网)☆45Updated 5 years ago
- Slides from various talks I gave☆18Updated 6 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆10Updated 2 years ago
- Large Scale BERT Distillation☆33Updated 2 years ago
- 以孤立语假设和宽度优先搜索为基础,构建了一种多通道堆叠注 意力Transformer结构的斗地主ai☆94Updated 4 years ago