YellowOldOdd/SDBI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YellowOldOdd/SDBI)

YellowOldOdd / SDBI

Simple Dynamic Batching Inference

☆144

Alternatives and similar repositories for SDBI

Users that are interested in SDBI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FrancescoB-Vintra / fp16tensorRT
View on GitHub
TensorRT half precision inference routine on a API-based TensorRT model
☆12Jul 3, 2018Updated 8 years ago
bytedance / effective_transformer
View on GitHub
Running BERT without Padding
☆479Mar 18, 2022Updated 4 years ago
ganler / ResearchReading
View on GitHub
General system research material (not limited to paper) reading notes.
☆22Mar 17, 2021Updated 5 years ago
L1aoXingyu / llm-infer-bench
View on GitHub
☆12Sep 1, 2023Updated 2 years ago
abeardear / ncnn-yolo
View on GitHub
convert pytorch trained yolo model to ncnn for Flexible deployment
☆10Aug 30, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chenjun2hao / ocr_annotation
View on GitHub
using python and flask for ocr annotation web tool
☆25Jan 7, 2020Updated 6 years ago
OpenPPL / ppl.llm.serving
View on GitHub
☆128Dec 24, 2024Updated last year
wenwenyu / MASTER-pytorch
View on GitHub
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
☆281Dec 26, 2021Updated 4 years ago
Jack47 / hack-SysML
View on GitHub
The road to hack SysML and become an system expert
☆516Sep 25, 2024Updated last year
thu-pacman / PET
View on GitHub
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆126Jun 23, 2022Updated 4 years ago
markwwen / ServingAgent
View on GitHub
A simple middleware to improving GPU utilization then speedup online inference.
☆19Feb 22, 2021Updated 5 years ago
gautam-aayush / form-data-augmentation
View on GitHub
Repository for augmenting data in forms, invoices and receipts for document image understanding
☆17May 6, 2021Updated 5 years ago
Funatiq / gossip
View on GitHub
gossip: Efficient Communication Primitives for Multi-GPU Systems
☆62Jul 1, 2022Updated 4 years ago
igormq / ctcdecode-pytorch
View on GitHub
Python implementation of CTC beam search decoder + agnostic LM scorer
☆20Dec 16, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
vancemiller / CUDA-preemption
View on GitHub
Experiments evaluating preemption on the NVIDIA Pascal architecture
☆16Nov 10, 2016Updated 9 years ago
tommyMessi / crnn_ctc-centerloss
View on GitHub
ctcloss + centerloss crnn text recognition
☆200Jan 28, 2021Updated 5 years ago
triton-inference-server / server
View on GitHub
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
☆10,858Updated this week
Media-Smart / volksdep
View on GitHub
volksdep is an open-source toolbox for deploying and accelerating PyTorch, ONNX and TensorFlow models with TensorRT.
☆285Feb 5, 2021Updated 5 years ago
onnxsim / onnxsim
View on GitHub
Simplify your onnx model
☆4,372Updated this week
XLPRUtils / pyxllib
View on GitHub
厦门理工模式识别团队通用python代码工具库
☆16Jun 23, 2026Updated 3 weeks ago
ModelTC / pyvlova
View on GitHub
Yet another Polyhedra Compiler for DeepLearning
☆19Apr 14, 2023Updated 3 years ago
DaertML / context_distillation
View on GitHub
Framework to achieve context distillation in LLMs
☆15Nov 24, 2023Updated 2 years ago
MauritsBleeker / Bi-STET
View on GitHub
Implementation of Bidirectional Scene Text Recognition with a Single Decoder
☆65Nov 24, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DataXujing / Bert_TensorRT
View on GitHub
Bert TensorRT模型加速部署
☆10Apr 1, 2022Updated 4 years ago
NVIDIA-AI-IOT / torch2trt
View on GitHub
An easy to use PyTorch to TensorRT converter
☆4,879Aug 17, 2024Updated last year
bytedance / lightseq
View on GitHub
LightSeq: A High Performance Library for Sequence Processing and Generation
☆3,296May 16, 2023Updated 3 years ago
LitLeo / TensorRT_Tutorial
View on GitHub
☆1,053Mar 13, 2024Updated 2 years ago
tvmai / meetup-slides
View on GitHub
Place for meetup slides
☆139Oct 11, 2020Updated 5 years ago
SymbioticLab / Salus
View on GitHub
Fine-grained GPU sharing primitives
☆149Jul 28, 2025Updated 11 months ago
tlc-pack / cutlass_fpA_intB_gemm
View on GitHub
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
☆96Jun 21, 2026Updated last month
bytedance / ByteTransformer
View on GitHub
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
☆479Mar 15, 2024Updated 2 years ago
Vivianyzw / PANnet
View on GitHub
Implementation of the paper "Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network"
☆16Nov 1, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
starmee / AI-Notes
View on GitHub
My learning notes about AI, including Machine Learning and Deep Learning.
☆18Jun 30, 2019Updated 7 years ago
shouxieai / cpp-rotation-album
View on GitHub
cpp rotation album，基于cpp eigen实现的3d旋转相册，GAMES101复现内容
☆12Jul 25, 2022Updated 3 years ago
google / iopddl
View on GitHub
Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning
☆25May 12, 2025Updated last year
sjtu-epcc / Tacker
View on GitHub
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆33Feb 10, 2025Updated last year
yester31 / Cutlass_EX
View on GitHub
study of cutlass
☆22Nov 10, 2024Updated last year
THUDM / MRT
View on GitHub
MRT: Tracing the Evolution of Scientific Publications (TKDE 2021)
☆18Mar 23, 2023Updated 3 years ago
NVIDIA / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆6,442Mar 27, 2024Updated 2 years ago