AmineDiro / cria
OpenAI compatible API for serving LLAMA-2 model
☆215Updated last year
Related projects ⓘ
Alternatives and complementary repositories for cria
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated 10 months ago
- LLM Orchestrator built in Rust☆267Updated 8 months ago
- Inference Llama 2 in one file of pure Rust 🦀