AmineDiro / criaLinks

OpenAI compatible API for serving LLAMA-2 model

☆218

Alternatives and similar repositories for cria

Users that are interested in cria are comparing it to the libraries listed below

Sorting:

pixelspark / poly
A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust
☆79Updated last year
spyglass-search / memex
Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.
☆62Updated 2 years ago
santiagomed / orca
LLM Orchestrator built in Rust
☆283Updated last year
qdrant / page-search
Neural search for web-sites, docs, articles - online!
☆142Updated 2 months ago
EricLBuehler / candle-lora
Low rank adaptation (LoRA) for Candle.
☆162Updated 5 months ago
ShelbyJenkins / llm_client
The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes
☆237Updated 2 months ago
Vaibhavs10 / fast-llm.rs
☆138Updated last year
gaxler / llama2.rs
Inference Llama 2 in one file of pure Rust 🦀
☆233Updated 2 years ago
minskylab / auto-rust
auto-rust is an experimental project that automatically generate Rust code with LLM (Large Language Models) during compilation, utilizing…
☆42Updated 10 months ago
atoma-network / atoma-infer
Fast serverless LLM inference, in Rust.
☆94Updated 7 months ago
coreylowman / llama-dfdx
LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!
☆109Updated 2 years ago
m1guelpf / tinyvector
A tiny embedding database in pure Rust.
☆419Updated last year
JackMatthewRimmer / rust-rag-toolchain
Library for doing RAG
☆77Updated 3 weeks ago
edgenai / llama_cpp-rs
High-level, optionally asynchronous Rust bindings to llama.cpp
☆231Updated last year
Noeda / rllama
Rust+OpenCL+AVX2 implementation of LLaMA inference code
☆546Updated last year
open-sauced / repo-query
Ask questions, get insights from repos
☆81Updated last year
EricLBuehler / candle-vllm
Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.
☆479Updated 2 weeks ago
juliooa / secondbrain
Multi-platform desktop app to download and run Large Language Models(LLM) locally in your computer.
☆290Updated 2 years ago
LaurentMazare / mamba.rs
☆133Updated last year
jimexist / surya-rs
Rust implementation of Surya
☆60Updated 7 months ago
Oxen-AI / GRPO-With-Cargo-Feedback
This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
☆107Updated 7 months ago
jeroenvlek / gpt-from-scratch-rs
Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle
☆75Updated last year
second-state / WasmEdge-WASINN-examples
☆255Updated 2 weeks ago
danielclough / fireside-chat
An LLM interface (chat bot) implemented in pure Rust using HuggingFace/Candle over Axum Websockets, an SQLite Database, and a Leptos (Was…
☆137Updated last year
LLukas22 / llm-rs-python
Unofficial python bindings for the rust llm library. 🐍❤️🦀
☆76Updated 2 years ago
leo-du / llama2.rs
Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust
☆39Updated 2 years ago
oxideai / mlx-rs
Unofficial Rust bindings to Apple's mlx framework
☆195Updated last week
rustformers / llmcord
A Discord bot, written in Rust, that generates responses using the LLaMA language model.
☆95Updated 2 years ago
KerfuffleV2 / smolrsrwkv
A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…
☆94Updated 2 years ago
fennel-ai / fann
Approx nearest neighbor search in Rust
☆167Updated 2 years ago