AtomaAI / atoma-inferView external linksLinks
Fast serverless LLM inference, in Rust.
☆110Nov 5, 2025Updated 3 months ago
Alternatives and similar repositories for atoma-infer
Users that are interested in atoma-infer are comparing it to the libraries listed below
Sorting:
- Compilation of atoma's documents☆18Sep 17, 2025Updated 4 months ago
- Graph model execution API for Candle☆17Jul 27, 2025Updated 6 months ago
- Agentic workflows leveraging the Atoma Network☆14May 22, 2025Updated 8 months ago
- Low rank adaptation (LoRA) for Candle.☆169Apr 18, 2025Updated 9 months ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆47May 3, 2024Updated last year
- A high-performance constrained decoding engine based on context free grammar in Rust☆58May 22, 2025Updated 8 months ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆591Jan 28, 2026Updated 2 weeks ago
- Automatically derive Python dunder methods for your Rust code☆23Jan 28, 2026Updated 2 weeks ago
- A Keras like abstraction layer on top of the Rust ML framework candle☆23Jun 16, 2024Updated last year
- implement llava using candle☆15Jun 9, 2024Updated last year
- Rust Workspace Bootstrapper☆18Oct 5, 2025Updated 4 months ago
- Your AI Copilot in Rust☆49Dec 17, 2023Updated 2 years ago
- Library for doing RAG☆82Feb 6, 2026Updated last week
- Candle Pipelines provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered …☆23Jan 5, 2026Updated last month
- Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle☆77Mar 31, 2024Updated last year
- Rust bindings for OpenNMT/CTranslate2☆49Feb 7, 2026Updated last week
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆39Aug 20, 2024Updated last year
- A high-performance RAG indexing pipeline implemented in Rust using LanceDB and Candle☆26Aug 2, 2024Updated last year
- Experimental compiler for deep learning models☆75Sep 18, 2025Updated 4 months ago
- > Gemini Rust Suite 🦀: A powerful, modular Rust toolkit for interacting with Google Gemini. Features a feature-rich CLI, persistent sema…☆17Apr 23, 2025Updated 9 months ago
- Nix flake for Solana development☆13Jul 8, 2022Updated 3 years ago
- World ID state bridge for Linea☆11Oct 21, 2024Updated last year
- Coloniz empowers you to build smart, autonomous communities effortlessly.☆27Jun 18, 2025Updated 7 months ago
- Fast, streaming indexing, query, and agentic LLM applications in Rust☆661Updated this week
- ☆33Jan 19, 2026Updated 3 weeks ago
- JAX bindings for the flash-attention3 kernels☆20Jan 2, 2026Updated last month
- Sampling techniques for Candle.☆19Apr 3, 2024Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 5 months ago
- Rust snippets and tips☆17Oct 20, 2021Updated 4 years ago
- A new version control system built from the ground up that is 10-100x faster than Git and built for an AI-native world.☆24Jan 9, 2026Updated last month
- ☆12Sep 27, 2017Updated 8 years ago
- Demo web-sys application☆15Jan 7, 2023Updated 3 years ago
- Rust library to access openai API☆14Dec 17, 2023Updated 2 years ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated 11 months ago
- ☆97Nov 14, 2025Updated 3 months ago
- A Rust 🦀 port of the Hugging Face smolagents library.☆42Mar 26, 2025Updated 10 months ago
- Instant, controllable, local pre-trained AI models in Rust☆2,135Updated this week
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆41Mar 15, 2024Updated last year
- A Rust API Gateway built on top of pingora☆18Oct 29, 2024Updated last year