Fast serverless LLM inference, in Rust.
☆119Nov 5, 2025Updated 6 months ago
Alternatives and similar repositories for atoma-infer
Users that are interested in atoma-infer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Core infrastructure for confidential computing AI inference☆35Dec 1, 2025Updated 5 months ago
- Graph model execution API for Candle☆17Jul 27, 2025Updated 9 months ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆48May 3, 2024Updated 2 years ago
- Low rank adaptation (LoRA) for Candle.☆172Apr 18, 2025Updated last year
- A high-performance constrained decoding engine based on context free grammar in Rust☆58May 22, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Automatically derive Python dunder methods for your Rust code☆25Apr 7, 2026Updated last month
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆652Apr 30, 2026Updated last week
- Rust Workspace Bootstrapper☆18Oct 5, 2025Updated 7 months ago
- Candle Pipelines provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered …☆23Jan 5, 2026Updated 4 months ago
- Library for doing RAG☆86Apr 8, 2026Updated 3 weeks ago
- ☆19Dec 31, 2025Updated 4 months ago
- Sampling techniques for Candle.☆21Apr 3, 2024Updated 2 years ago
- implement llava using candle☆15Jun 9, 2024Updated last year
- ☆41Nov 18, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Dec 21, 2025Updated 4 months ago
- Rust bindings for OpenNMT/CTranslate2☆53Apr 5, 2026Updated last month
- Run Generative AI models directly on your hardware☆42Aug 7, 2024Updated last year
- Experimental compiler for deep learning models☆75Sep 18, 2025Updated 7 months ago
- Your AI Copilot in Rust☆51Dec 17, 2023Updated 2 years ago
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆42Aug 20, 2024Updated last year
- High-Performance K-Means Clustering Library☆41Jul 6, 2025Updated 10 months ago
- Rust snippets and tips☆17Oct 20, 2021Updated 4 years ago
- Minimalist ML framework for Rust☆22Feb 28, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A rust wrapper for HIP☆12Jun 10, 2025Updated 10 months ago
- 8-bit floating point types for Rust☆64Feb 4, 2026Updated 3 months ago
- A comprehensive Rust translation of the code from Sebastian Raschka's Build an LLM from Scratch book.☆316Apr 29, 2026Updated last week
- A whisper <lib|cli|server> written in rust☆20Apr 30, 2026Updated last week
- Fast, streaming indexing, query, and agentic LLM applications in Rust☆692Updated this week
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆293Apr 30, 2026Updated last week
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.☆48Feb 18, 2025Updated last year
- a bot using an OODA loop...☆25Jan 19, 2026Updated 3 months ago
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆42Mar 15, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle☆77Mar 31, 2024Updated 2 years ago
- An opentelemetry receiver that generates configurable metrics & traces to emulate live services☆19Sep 13, 2024Updated last year
- World ID state bridge for Linea☆11Oct 21, 2024Updated last year
- A collection of optimisers for use with candle☆46Apr 6, 2026Updated last month
- Instant, controllable, local pre-trained AI models in Rust☆2,185Updated this week
- Structured outputs for LLMs☆54Jul 15, 2024Updated last year
- CLI utility to inspect and explore .safetensors and .gguf files☆51Oct 28, 2025Updated 6 months ago