EricLBuehler / candle-samplingLinks
Sampling techniques for Candle.
☆19Updated last year
Alternatives and similar repositories for candle-sampling
Users that are interested in candle-sampling are comparing it to the libraries listed below
Sorting:
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆46Updated last year
- Low rank adaptation (LoRA) for Candle.☆166Updated 7 months ago
- Graph model execution API for Candle☆16Updated 3 months ago
- Fast, Lightweight, Unified Engine for Text2Image Diffusion Models☆19Updated 7 months ago
- A Fish Speech implementation in Rust, with Candle.rs☆103Updated 5 months ago
- Fast serverless LLM inference, in Rust.☆105Updated 2 weeks ago
- Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle☆76Updated last year
- A collection of optimisers for use with candle☆43Updated 3 months ago
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆242Updated last month
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆110Updated 2 years ago
- CLI utility to inspect and explore .safetensors and .gguf files☆34Updated 3 weeks ago
- Experimental compiler for deep learning models☆70Updated 2 months ago
- Blazingly fast inference of diffusion models.☆116Updated 7 months ago
- Inference engine for GLiNER models, in Rust☆76Updated last week
- Automatically derive Python dunder methods for your Rust code☆20Updated 7 months ago
- Structured outputs for LLMs☆52Updated last year
- ☆19Updated last year
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Updated last year
- implement llava using candle☆15Updated last year
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆239Updated 3 months ago
- ☆135Updated last year
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆40Updated last year
- A high-performance constrained decoding engine based on context free grammar in Rust☆55Updated 5 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆39Updated 2 years ago
- Transformers provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered by t…☆20Updated 4 months ago
- ☆36Updated last year
- Tera is an AI assistant which is tailored just for you and runs fully locally.☆86Updated last year
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆111Updated 8 months ago
- A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.☆202Updated last week
- High-level, optionally asynchronous Rust bindings to llama.cpp☆234Updated last year