LlamaEdge / sd-api-server
The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge
☆19Updated 2 weeks ago
Alternatives and similar repositories for sd-api-server:
Users that are interested in sd-api-server are comparing it to the libraries listed below
- A RAG API server written in Rust following OpenAI specs☆38Updated this week
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆12Updated 3 weeks ago
- ☆24Updated this week
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Updated last year
- The Google mediapipe AI library. Write AI inference applications for image recognition, text classification, audio / video processing and…☆170Updated 3 months ago
- OpenAI compatible API for serving LLAMA-2 model☆215Updated last year
- High-level bindings for wasi-nn system calls☆18Updated 6 months ago
- wasm-interface-types supplement & compiler of wasmedge☆15Updated last year
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆231Updated 6 months ago
- Make ETLs Great Again!☆42Updated last year
- Embed WasmEdge functions in a Rust host app☆31Updated last month
- Rust bindings for OpenNMT/CTranslate2☆26Updated 3 weeks ago
- Lightweight web service clients in the WasmEdge Runtime using the Rust reqwest framework☆12Updated 6 months ago
- Implementing the BitNet model in Rust☆29Updated 9 months ago
- This application demonstrates how to launch high-performance "serverless" functions from the YoMo framework to process streaming data. Th…☆64Updated last year
- Lightweight HTTP servers based on hyper / warp frameworks in the WasmEdge Runtime.☆84Updated 6 months ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆293Updated 2 weeks ago
- Library for doing RAG☆51Updated last month
- Rust bindings to https://github.com/k2-fsa/sherpa-onnx☆112Updated last week
- ☆242Updated last week
- Blazingly fast inference of diffusion models.☆98Updated 3 weeks ago
- Rust implementation of Surya☆56Updated 3 weeks ago
- 🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.☆300Updated this week
- A Fish Speech implementation in Rust, with Candle.rs☆66Updated last week
- A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)☆103Updated 2 months ago
- Let WebAssembly's exported function support more data types for its parameters and return values.☆30Updated last year
- Simple Rust applications that run in WasmEdge☆32Updated last year
- Low rank adaptation (LoRA) for Candle.☆138Updated 5 months ago
- LM inference server implementation based on *.cpp.☆67Updated this week
- Implementation of the RWKV language model in pure WebGPU/Rust.☆274Updated this week