darkautism / llmserver-rsLinks
A Rust-based, OpenAI-style API server for large language models (LLMs)
☆17Updated 2 weeks ago
Alternatives and similar repositories for llmserver-rs
Users that are interested in llmserver-rs are comparing it to the libraries listed below
Sorting:
- Easier usage of LLMs in Rockchip's NPU on SBCs like Orange Pi 5 and Radxa Rock 5 series☆167Updated 5 months ago
- Streaming TTS based on Piper with optional RK3588 NPU support☆118Updated 8 months ago
- Ollama alternative for Rockchip NPU: An efficient solution for running AI and Deep learning models on Rockchip devices with optimized NPU…☆385Updated this week
- top-like script for rockhip NPUs on linux☆63Updated 2 months ago
- Easy installation and usage of Rockchip's NPUs found in RK3588 and similar SoCs☆223Updated 5 months ago
- Automated script to convert Huggingface and GGUF models to rkllm format for running on Rockchip NPU☆37Updated last year
- ☆49Updated 11 months ago
- Run Large Language Models on RK3588 with GPU-acceleration☆121Updated 2 years ago
- Radxa Zero 3W/E image with Ubuntu 22, OpenCV, deep learning frameworks and NPU drivers☆55Updated last week
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆114Updated 2 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆87Updated last week
- 🖥️ myrktop - Orange Pi 5 (RK3588) System Monitoring script CPU,RAM,NPU,GPU,TEMPERATURES☆25Updated 9 months ago
- Arch Linux ARM image builder and installer for aarch64 UEFI devices, focusing on rk3588☆72Updated this week
- See how to play with ROCm, run it with AMD GPUs!☆39Updated 8 months ago
- Additional device tree overlays to support different hardwares on Radxa products☆71Updated this week
- ROCm docker images with fixes/support for legecy architecture gfx803. eg.Radeon RX 590/RX 580/RX 570/RX 480☆82Updated 8 months ago
- Rockchip系SoCのHWエンコーダ(rkmpp)の性能実験☆104Updated 2 weeks ago
- ☆20Updated last year
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Updated this week
- Web chat front end for rk3588_npu_llm_server / RK3588 LLM chat interface☆14Updated last year
- Light WebUI for lm.rs☆24Updated last year
- SDK for SenseCAP AI Watcher☆53Updated 3 months ago
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…☆142Updated 2 months ago
- Armbian with updated RKNPU drivers☆27Updated 2 months ago
- A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.☆229Updated 3 weeks ago
- MLC Stable Diffusion for RK3588's Mali GPU☆40Updated last year
- Triton for AMD MI25/50/60. Development repository for the Triton language and compiler☆32Updated last month
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆21Updated 4 months ago
- Run stable-diffusion-webui with Radeon RX 580 8GB on Ubuntu 22.04.2 LTS☆67Updated 2 years ago
- ☆26Updated 11 months ago