tomsanbear / bitnet-rsLinks
Implementing the BitNet model in Rust
☆42Updated last year
Alternatives and similar repositories for bitnet-rs
Users that are interested in bitnet-rs are comparing it to the libraries listed below
Sorting:
- Experimental compiler for deep learning models☆72Updated 3 months ago
- Low rank adaptation (LoRA) for Candle.☆168Updated 8 months ago
- A diffusers API in Burn (Rust)☆22Updated last year
- ☆19Updated last year
- A collection of optimisers for use with candle☆44Updated 2 weeks ago
- A Fish Speech implementation in Rust, with Candle.rs☆106Updated 6 months ago
- 8-bit floating point types for Rust☆62Updated 3 weeks ago
- GPU based FFT written in Rust and CubeCL☆25Updated 6 months ago
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Updated last year
- High-level, optionally asynchronous Rust bindings to llama.cpp☆240Updated last year
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆110Updated 2 years ago
- A Keras like abstraction layer on top of the Rust ML framework candle☆23Updated last year
- Rust library for whisper.cpp compatible Mel spectrograms☆80Updated 3 weeks ago
- Fast serverless LLM inference, in Rust.☆108Updated last month
- Rust standalone inference of Namo-500M series models. Extremly tiny, runing VLM on CPU.☆24Updated 9 months ago
- ☆32Updated 2 years ago
- Bleeding edge low level Rust binding for GGML☆16Updated last year
- A neural network inference library, written in Rust.☆70Updated last year
- An unofficial implementation of BitNet☆11Updated last year
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.☆47Updated 10 months ago
- Unofficial Rust bindings to Apple's mlx framework☆219Updated last week
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆40Updated last year
- Modular Rust transformer/LLM library using Candle☆37Updated last year
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆94Updated 2 years ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆46Updated last year
- Rust port of annoy (https://github.com/spotify/annoy)☆45Updated 4 months ago
- Blazingly fast inference of diffusion models.☆117Updated 8 months ago
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆246Updated 2 weeks ago
- Andrej Karpathy's Let's build GPT: from scratch video & notebook implemented in Rust + candle☆77Updated last year
- A minimal OpenCL, CUDA, Vulkan and host CPU array manipulation engine / framework.☆77Updated 4 months ago