pixelspark / polyLinks
A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust
β79Updated last year
Alternatives and similar repositories for poly
Users that are interested in poly are comparing it to the libraries listed below
Sorting:
- Low rank adaptation (LoRA) for Candle.β158Updated 4 months ago
- Inference Llama 2 in one file of pure Rust π¦β233Updated 2 years ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Faceβ38Updated last year
- A Fish Speech implementation in Rust, with Candle.rsβ96Updated 3 months ago
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.β47Updated 6 months ago
- Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candleβ75Updated last year
- Rust implementation of Suryaβ60Updated 6 months ago
- High-level, optionally asynchronous Rust bindings to llama.cppβ228Updated last year
- OpenAI compatible API for serving LLAMA-2 modelβ218Updated last year
- Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)β121Updated 11 months ago
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python packageβ221Updated 2 months ago
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibesβ233Updated last month
- LLM Orchestrator built in Rustβ282Updated last year
- Llama2 LLM ported to Rust burnβ280Updated last year
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!β109Updated 2 years ago
- Unofficial Rust bindings to Apple's mlx frameworkβ189Updated last week
- auto-rust is an experimental project that automatically generate Rust code with LLM (Large Language Models) during compilation, utilizingβ¦β41Updated 10 months ago
- AI gateway and observability server written in Rust. Designed to help optimize multi-agent workflows.β62Updated last year
- An extension library to Candle that provides PyTorch functions not currently available in Candleβ40Updated last year
- An LLM interface (chat bot) implemented in pure Rust using HuggingFace/Candle over Axum Websockets, an SQLite Database, and a Leptos (Wasβ¦β136Updated 11 months ago
- Rust bindings to https://github.com/k2-fsa/sherpa-onnxβ217Updated 4 months ago
- Library for doing RAGβ75Updated last month
- Fast serverless LLM inference, in Rust.β91Updated 6 months ago
- Extract core logic from qdrant and make it available as a library.β61Updated last year
- Port of Andrej Karpathy's minbpe to Rustβ28Updated last year
- A Rust implementation of OpenAI's Whisper model using the burn frameworkβ322Updated last year
- Blazingly fast inference of diffusion models.β114Updated 5 months ago
- Approx nearest neighbor search in Rustβ166Updated 2 years ago
- A collection of optimisers for use with candleβ40Updated last month
- Rust client for txtaiβ110Updated 2 weeks ago