atoma-network / atoma-inferLinks

Fast serverless LLM inference, in Rust.

☆94

Alternatives and similar repositories for atoma-infer

Users that are interested in atoma-infer are comparing it to the libraries listed below

Sorting:

jeroenvlek / gpt-from-scratch-rs
Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle
☆77Updated last year
Oxen-AI / GRPO-With-Cargo-Feedback
This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
☆108Updated 7 months ago
LaurentMazare / ug
Experimental compiler for deep learning models
☆67Updated last month
EricLBuehler / candle-lora
Low rank adaptation (LoRA) for Candle.
☆162Updated 6 months ago
oxideai / mlx-rs
Unofficial Rust bindings to Apple's mlx framework
☆202Updated 3 weeks ago
Dan-wanna-M / kbnf
A high-performance constrained decoding engine based on context free grammar in Rust
☆55Updated 5 months ago
JackMatthewRimmer / rust-rag-toolchain
Library for doing RAG
☆77Updated last week
nerdai / llms-from-scratch-rs
A comprehensive Rust translation of the code from Sebastian Raschka's Build an LLM from Scratch book.
☆253Updated last week
ShelbyJenkins / candle_embed
A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face
☆45Updated last year
EricLBuehler / candle-vllm
Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.
☆496Updated last week
coreylowman / llama-dfdx
LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!
☆110Updated 2 years ago
ShelbyJenkins / llm_client
The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes
☆238Updated 2 months ago
0xPlaygrounds / rig-examples
Examples and use cases for building LLM-Powered apps with Rig
☆80Updated last year
pixelspark / poly
A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust
☆79Updated last year
567-labs / instructor-rs
Structured outputs for LLMs
☆52Updated last year
Narsil / bindgen_cuda
☆24Updated 6 months ago
mokeyish / candle-ext
An extension library to Candle that provides PyTorch functions not currently available in Candle
☆40Updated last year
huggingface / hf-hub
Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package
☆236Updated last month
KGrewal1 / candle-optimisers
A collection of optimisers for use with candle
☆43Updated 2 months ago
jafioti / dataflow
Dataflow is a data processing library, primarily for machine learning.
☆24Updated 2 years ago
ShelbyJenkins / llm_utils
llm_utils: Basic LLM tools, best practices, and minimal abstraction.
☆47Updated 8 months ago
a-agmon / rs-graph-llm
High-performance framework for building interactive multi-agent workflow systems in Rust
☆167Updated last month
ljt019 / transformers
Transformers provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered by t…
☆19Updated 3 months ago
AbdelStark / anthropic-rs
Anthropic Rust SDK 🦀 with async support.
☆66Updated 8 months ago
gaxler / llama2.rs
Inference Llama 2 in one file of pure Rust 🦀
☆233Updated 2 years ago
fbilhaut / gline-rs
Inference engine for GLiNER models, in Rust
☆74Updated this week
lucasjinreal / Crane
A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.
☆171Updated this week
IncredibleDevHQ / agent-panel
AI gateway and observability server written in Rust. Designed to help optimize multi-agent workflows.
☆64Updated last year
santiagomed / orca
LLM Orchestrator built in Rust
☆283Updated last year
eugenehp / gpu-fft
GPU based FFT written in Rust and CubeCL
☆24Updated 4 months ago