3xMike / tritonserver-rsLinks

Rust crate for easy and efficient ML model inference

☆29

Alternatives and similar repositories for tritonserver-rs

Users that are interested in tritonserver-rs are comparing it to the libraries listed below

Sorting:

mstallmo / tensorrt-rs
Rust library for running TensorRT accelerated deep learning models
☆62Updated 4 years ago
nbigaouette / onnxruntime-rs
Rust wrapper for Microsoft's ONNX Runtime (version 1.8)
☆312Updated last year
tomsanbear / candle-einops
☆36Updated last year
haixuanTao / onnxruntime-rs
Rust wrapper for Microsoft's ONNX Runtime with CUDA support (version 1.7)
☆24Updated 3 years ago
octoml / triton-client-rs
A client library in Rust for Nvidia Triton.
☆30Updated 2 years ago
Narsil / bindgen_cuda
☆26Updated 7 months ago
tracel-ai / models
Models and examples built with Burn
☆305Updated 2 weeks ago
EricLBuehler / candle-lora
Low rank adaptation (LoRA) for Candle.
☆168Updated 7 months ago
intel / openvino-rs
Rust bindings for OpenVINO™
☆108Updated 3 months ago
insight-platform / Similari
A framework for building high-performance real-time multiple object trackers
☆252Updated 7 months ago
KGrewal1 / candle-optimisers
A collection of optimisers for use with candle
☆43Updated 3 months ago
oddity-ai / async-tensorrt
Asynchronous TensorRT for Rust.
☆37Updated 2 months ago
FL33TW00D / steelix
Your one stop CLI for ONNX model analysis.
☆47Updated 3 years ago
robertknight / rten
ONNX neural network inference engine
☆258Updated this week
mokeyish / candle-ext
An extension library to Candle that provides PyTorch functions not currently available in Candle
☆40Updated last year
huggingface / hf-hub
Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package
☆242Updated last month
eugenehp / gpu-fft
GPU based FFT written in Rust and CubeCL
☆24Updated 5 months ago
ssoudan / tch-m1
Example of tch-rs on M1
☆55Updated last year
gaxler / llama2.rs
Inference Llama 2 in one file of pure Rust 🦀
☆233Updated 2 years ago
chenwanqq / candle-llava
implement llava using candle
☆15Updated last year
LaurentMazare / tboard-rs
Read and write tensorboard data using Rust
☆23Updated last year
haixuanTao / bert-onnx-rs-server
A Demo server serving Bert through ONNX with GPU written in Rust with <3
☆41Updated 4 years ago
santiagomed / orca
LLM Orchestrator built in Rust
☆284Updated last year
guillaume-be / rust-tokenizers
Rust-tokenizer offers high-performance tokenizers for modern language models, including WordPiece, Byte-Pair Encoding (BPE) and Unigram (…
☆327Updated 2 years ago
EricLBuehler / candle-vllm
Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.
☆529Updated last week
huggingface / pyo3-special-method-derive
Automatically derive Python dunder methods for your Rust code
☆20Updated 7 months ago
kykosic / actix-pytorch-example
An example of using Torch rust bindings to serve trained machine learning models via Actix Web
☆16Updated 4 years ago
EricLBuehler / safetensors_explorer
CLI utility to inspect and explore .safetensors and .gguf files
☆34Updated 3 weeks ago
boncheolgu / tflite-rs
☆128Updated last year
cpcdoy / rust-sbert
Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)
☆122Updated last year