pixelspark / polyLinks

A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust

☆79

Alternatives and similar repositories for poly

Users that are interested in poly are comparing it to the libraries listed below

Sorting:

EricLBuehler / candle-lora
Low rank adaptation (LoRA) for Candle.
☆168Updated 7 months ago
gaxler / llama2.rs
Inference Llama 2 in one file of pure Rust 🦀
☆235Updated 2 years ago
ShelbyJenkins / candle_embed
A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face
☆46Updated last year
jimexist / surya-rs
Rust implementation of Surya
☆63Updated 9 months ago
AmineDiro / cria
OpenAI compatible API for serving LLAMA-2 model
☆218Updated 2 years ago
santiagomed / orca
LLM Orchestrator built in Rust
☆285Updated last year
ShelbyJenkins / llm_utils
llm_utils: Basic LLM tools, best practices, and minimal abstraction.
☆47Updated 9 months ago
EndlessReform / fish-speech.rs
A Fish Speech implementation in Rust, with Candle.rs
☆105Updated 6 months ago
jeroenvlek / gpt-from-scratch-rs
Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle
☆76Updated last year
chelsea0x3b / llama-dfdx
LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!
☆110Updated 2 years ago
edgenai / llama_cpp-rs
High-level, optionally asynchronous Rust bindings to llama.cpp
☆235Updated last year
ShelbyJenkins / llm_client
The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes
☆239Updated 3 months ago
huggingface / hf-hub
Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package
☆243Updated 2 weeks ago
minskylab / auto-rust
auto-rust is an experimental project that automatically generate Rust code with LLM (Large Language Models) during compilation, utilizing…
☆44Updated last year
KGrewal1 / candle-optimisers
A collection of optimisers for use with candle
☆44Updated this week
tyrchen / qdrant-lib
Extract core logic from qdrant and make it available as a library.
☆61Updated last year
cpcdoy / rust-sbert
Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)
☆122Updated last year
Gadersd / llama2-burn
Llama2 LLM ported to Rust burn
☆278Updated last year
mokeyish / candle-ext
An extension library to Candle that provides PyTorch functions not currently available in Candle
☆40Updated last year
fennel-ai / fann
Approx nearest neighbor search in Rust
☆165Updated 2 years ago
atoma-network / atoma-infer
Fast serverless LLM inference, in Rust.
☆108Updated last month
LaurentMazare / glim
☆19Updated last year
ssoudan / tch-m1
Example of tch-rs on M1
☆55Updated last year
danielclough / fireside-chat
An LLM interface (chat bot) implemented in pure Rust using HuggingFace/Candle over Axum Websockets, an SQLite Database, and a Leptos (Was…
☆137Updated last year
JackMatthewRimmer / rust-rag-toolchain
Library for doing RAG
☆78Updated last week
IncredibleDevHQ / agent-panel
AI gateway and observability server written in Rust. Designed to help optimize multi-agent workflows.
☆64Updated last year
jafioti / dataflow
Dataflow is a data processing library, primarily for machine learning.
☆24Updated 2 years ago
oxideai / mlx-rs
Unofficial Rust bindings to Apple's mlx framework
☆210Updated last week
shafishlabs / llmchain-rs
🦀Rust + Large Language Models - Make AI Services Freely and Easily.
☆182Updated last year
tomsanbear / bitnet-rs
Implementing the BitNet model in Rust
☆42Updated last year