3xMike / tritonserver-rsLinks
Rust crate for easy and efficient ML model inference
☆23Updated 2 months ago
Alternatives and similar repositories for tritonserver-rs
Users that are interested in tritonserver-rs are comparing it to the libraries listed below
Sorting:
- Rust library for running TensorRT accelerated deep learning models☆56Updated 3 years ago
- Rust wrapper for Microsoft's ONNX Runtime with CUDA support (version 1.7)☆23Updated 2 years ago
- GPU based FFT written in Rust and CubeCL☆22Updated 2 months ago
- Rust wrapper for Microsoft's ONNX Runtime (version 1.8)☆296Updated last year
- Low rank adaptation (LoRA) for Candle.☆148Updated last month
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆39Updated last year
- ☆30Updated 6 months ago
- A collection of optimisers for use with candle☆36Updated 2 weeks ago
- ☆13Updated last year
- ☆23Updated last month
- Example of tch-rs on M1☆53Updated last year
- Asynchronous TensorRT for Rust.☆30Updated last month
- ☆126Updated 11 months ago
- Rust bindings for OpenVINO™☆96Updated 2 months ago
- A Demo server serving Bert through ONNX with GPU written in Rust with <3☆40Updated 3 years ago
- implement llava using candle☆15Updated 11 months ago
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆207Updated 3 months ago
- Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.☆77Updated last year
- A framework for building high-performance real-time multiple object trackers☆237Updated 2 months ago
- A client library in Rust for Nvidia Triton.☆30Updated last year
- Asynchronous CUDA for Rust.☆33Updated 7 months ago
- Transformers provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered by t…☆16Updated this week
- Savant Library with new generation primitives re-implemented in Rust☆14Updated last week
- A high-performance RAG indexing pipeline implemented in Rust using LanceDB and Candle☆15Updated 10 months ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆37Updated last year
- Models and examples built with Burn☆244Updated last week
- ONNX neural network inference engine☆210Updated this week
- An example of using Torch rust bindings to serve trained machine learning models via Actix Web☆16Updated 3 years ago
- Dead simple implementation of Discrete Kalman filter for object tracking purposes☆15Updated last year
- A simplified example in Rust of training a neural network and then using it based on the Candle Framework by Hugging Face.☆38Updated last year