octoml / triton-client-rsLinks
A client library in Rust for Nvidia Triton.
☆30Updated last year
Alternatives and similar repositories for triton-client-rs
Users that are interested in triton-client-rs are comparing it to the libraries listed below
Sorting:
- Rust crate for some audio utilities☆23Updated 2 months ago
- ☆23Updated last month
- Example of tch-rs on M1☆53Updated last year
- ☆30Updated 6 months ago
- Rust wrapper for Microsoft's ONNX Runtime (version 1.8)☆296Updated last year
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆39Updated last year
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆207Updated 3 months ago
- implement llava using candle☆15Updated 11 months ago
- Asynchronous CUDA for Rust.☆33Updated 7 months ago
- Your one stop CLI for ONNX model analysis.☆47Updated 2 years ago
- Rust wrapper for Microsoft's ONNX Runtime with CUDA support (version 1.7)☆23Updated 2 years ago
- Rust library for whisper.cpp compatible Mel spectrograms☆68Updated 3 weeks ago
- A collection of optimisers for use with candle☆36Updated 2 weeks ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆24Updated 2 months ago
- GPU based FFT written in Rust and CubeCL☆22Updated 2 months ago
- Low rank adaptation (LoRA) for Candle.☆147Updated last month
- A Demo server serving Bert through ONNX with GPU written in Rust with <3☆40Updated 3 years ago
- Rust implementation of Huggingface transformers pipelines using onnxruntime backend with bindings to C# and C.☆39Updated 2 years ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆372Updated this week
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated last year
- ☆86Updated 4 months ago
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆147Updated this week
- Dataflow is a data processing library, primarily for machine learning.☆21Updated last year
- Sample Python extension using Rust/PyO3/tch to interact with PyTorch☆36Updated last year
- 8-bit floating point types for Rust☆45Updated 2 months ago
- Automatically derive Python dunder methods for your Rust code☆19Updated last month
- A diffusers API in Burn (Rust)☆19Updated 10 months ago
- Experimental ONNX implementation for WASI NN.☆48Updated 3 years ago
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆104Updated last year
- ☆26Updated last year