AmineDiro / cria
OpenAI compatible API for serving LLAMA-2 model
☆215Updated last year
Alternatives and similar repositories for cria:
Users that are interested in cria are comparing it to the libraries listed below
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Updated last year
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆57Updated last year
- LLM Orchestrator built in Rust☆272Updated 11 months ago
- Inference Llama 2 in one file of pure Rust 🦀☆232Updated last year
- Neural search for web-sites, docs, articles - online!☆131Updated 4 months ago
- auto-rust is an experimental project that automatically generate Rust code with LLM (Large Language Models) during compilation, utilizing…☆36Updated 3 months ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆183Updated 2 weeks ago
- Unofficial Rust bindings to Apple's mlx framework☆132Updated this week
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆320Updated this week
- An LLM interface (chat bot) implemented in pure Rust using HuggingFace/Candle over Axum Websockets, an SQLite Database, and a Leptos (Was…☆128Updated 4 months ago
- High-level, optionally asynchronous Rust bindings to llama.cpp☆208Updated 8 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆37Updated last year
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆103Updated last year
- Llama2 LLM ported to Rust burn☆277Updated 10 months ago
- ☆245Updated this week
- Library for doing RAG☆68Updated 2 months ago
- Low rank adaptation (LoRA) for Candle.☆144Updated 6 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- ☆135Updated last year
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆41Updated last week
- ☆125Updated 10 months ago
- A tiny embedding database in pure Rust.☆395Updated last year
- bott: Your Terminal Copilot☆86Updated last year
- Faster structured generation☆185Updated this week
- A Rust implementation of OpenAI's Whisper model using the burn framework☆292Updated 9 months ago
- Rust implementation of Surya☆57Updated this week
- ☆224Updated this week
- Tera is an AI assistant which is tailored just for you and runs fully locally.☆74Updated last year
- Rust+OpenCL+AVX2 implementation of LLaMA inference code☆542Updated last year