zackshen / ggufLinks
a GGUF file parser
☆13Updated 2 months ago
Alternatives and similar repositories for gguf
Users that are interested in gguf are comparing it to the libraries listed below
Sorting:
- 8-bit floating point types for Rust☆48Updated 3 weeks ago
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆40Updated last year
- Bleeding edge low level Rust binding for GGML☆16Updated last year
- Rust library for scheduling, managing resources, and running DAGs 🌙☆33Updated 6 months ago
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.☆46Updated 5 months ago
- High-level, optionally asynchronous Rust bindings to llama.cpp☆226Updated last year
- Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle☆74Updated last year
- A simplified example in Rust of training a neural network and then using it based on the Candle Framework by Hugging Face.☆39Updated last year
- ☆90Updated 7 months ago
- ☆10Updated 5 months ago
- GGML bindings that aim to be idiomatic Rust rather than directly corresponding to the C/C++ interface☆19Updated last year
- A collection of boosting algorithms written in Rust 🦀☆57Updated 2 months ago
- A Fish Speech implementation in Rust, with Candle.rs☆94Updated 2 months ago
- ☆20Updated 10 months ago
- Low rank adaptation (LoRA) for Candle.☆152Updated 3 months ago
- A minimal OpenCL, CUDA, Vulkan and host CPU array manipulation engine / framework.☆74Updated last month
- A GPT Implementation in Rust on top of tch-rs 🔥 🦀☆48Updated 2 months ago
- A neural network inference library, written in Rust.☆63Updated last year
- Experimental compiler for deep learning models☆68Updated 2 months ago
- Friendly interface to chat with an Ollama instance.☆76Updated 3 weeks ago
- ONNX neural network inference engine☆222Updated this week
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆108Updated 2 years ago
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated last year
- GPU based FFT written in Rust and CubeCL☆23Updated 2 months ago
- A Rust Vector which swaps to disk based on given parameters☆44Updated last year
- ☆47Updated 2 weeks ago
- Blazingly fast inference of diffusion models.☆112Updated 4 months ago
- Parallelo Parallel Library (PPL) is a small parallel framework that brings Structured Parallel Programming in Rust.☆77Updated last month
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆37Updated last year
- ☆23Updated 3 months ago