EricLBuehler / candle-samplingLinks
Sampling techniques for Candle.
☆19Updated last year
Alternatives and similar repositories for candle-sampling
Users that are interested in candle-sampling are comparing it to the libraries listed below
Sorting:
- Graph model execution API for Candle☆17Updated 5 months ago
- Low rank adaptation (LoRA) for Candle.☆168Updated 8 months ago
- A collection of optimisers for use with candle☆45Updated this week
- Fast serverless LLM inference, in Rust.☆108Updated last month
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆111Updated 2 years ago
- Fast, Lightweight, Unified Engine for Text2Image Diffusion Models☆19Updated 8 months ago
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆249Updated 2 weeks ago
- A Fish Speech implementation in Rust, with Candle.rs☆106Updated 6 months ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆46Updated last year
- ☆38Updated last year
- Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle☆77Updated last year
- Blazingly fast inference of diffusion models.☆117Updated 8 months ago
- Inference engine for GLiNER models, in Rust☆81Updated last month
- implement llava using candle☆15Updated last year
- auto-rust is an experimental project that automatically generate Rust code with LLM (Large Language Models) during compilation, utilizing…☆45Updated last year
- Rust implementation of Huggingface transformers pipelines using onnxruntime backend with bindings to C# and C.☆41Updated 2 years ago
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆40Updated last year
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Updated last year
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆240Updated 4 months ago
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆24Updated 9 months ago
- ☆19Updated last year
- Transformers provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered by t…☆21Updated this week
- Unofficial Rust bindings to Apple's mlx framework☆221Updated 2 weeks ago
- Automatically derive Python dunder methods for your Rust code☆20Updated 8 months ago
- High-level, optionally asynchronous Rust bindings to llama.cpp☆240Updated last year
- Rust crate for some audio utilities☆25Updated 9 months ago
- GPU based FFT written in Rust and CubeCL☆25Updated 6 months ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆57Updated 7 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆39Updated 2 years ago
- ☆135Updated last year