microsoft / batch-inference

Dynamic batching library for Deep Learning inference. Tutorials for LLM, GPT scenarios.
81Updated last month

Related projects: