pixelspark / polyLinks
A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust
β79Updated last year
Alternatives and similar repositories for poly
Users that are interested in poly are comparing it to the libraries listed below
Sorting:
- Low rank adaptation (LoRA) for Candle.β168Updated 7 months ago
- Inference Llama 2 in one file of pure Rust π¦β235Updated 2 years ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Faceβ46Updated last year
- Rust implementation of Suryaβ63Updated 9 months ago
- OpenAI compatible API for serving LLAMA-2 modelβ218Updated 2 years ago
- LLM Orchestrator built in Rustβ285Updated last year
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.β47Updated 9 months ago
- A Fish Speech implementation in Rust, with Candle.rsβ105Updated 6 months ago
- Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candleβ76Updated last year
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!β110Updated 2 years ago
- High-level, optionally asynchronous Rust bindings to llama.cppβ235Updated last year
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibesβ239Updated 3 months ago
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python packageβ243Updated 2 weeks ago
- auto-rust is an experimental project that automatically generate Rust code with LLM (Large Language Models) during compilation, utilizingβ¦β44Updated last year
- A collection of optimisers for use with candleβ44Updated this week
- Extract core logic from qdrant and make it available as a library.β61Updated last year
- Rust port of sentence-transformers (https://github.com/UKPLab/sentence-transformers)β122Updated last year
- Llama2 LLM ported to Rust burnβ278Updated last year
- An extension library to Candle that provides PyTorch functions not currently available in Candleβ40Updated last year
- Approx nearest neighbor search in Rustβ165Updated 2 years ago
- Fast serverless LLM inference, in Rust.β108Updated last month
- β19Updated last year
- Example of tch-rs on M1β55Updated last year
- An LLM interface (chat bot) implemented in pure Rust using HuggingFace/Candle over Axum Websockets, an SQLite Database, and a Leptos (Wasβ¦β137Updated last year
- Library for doing RAGβ78Updated last week
- AI gateway and observability server written in Rust. Designed to help optimize multi-agent workflows.β64Updated last year
- Dataflow is a data processing library, primarily for machine learning.β24Updated 2 years ago
- Unofficial Rust bindings to Apple's mlx frameworkβ210Updated last week
- π¦Rust + Large Language Models - Make AI Services Freely and Easily.β182Updated last year
- Implementing the BitNet model in Rustβ42Updated last year