huggingface / candle-paged-attention
☆12Updated last year
Alternatives and similar repositories for candle-paged-attention
Users that are interested in candle-paged-attention are comparing it to the libraries listed below
Sorting:
- ☆19Updated 7 months ago
- implement llava using candle☆14Updated 11 months ago
- Read and write tensorboard data using Rust☆21Updated last year
- A collection of optimisers for use with candle☆35Updated last week
- Rust crate for some audio utilities☆23Updated 2 months ago
- Load compute kernels from the Hub☆119Updated last week
- ☆23Updated last month
- ☆26Updated last year
- ☆14Updated 5 months ago
- ☆11Updated 3 months ago
- Sample Python extension using Rust/PyO3/tch to interact with PyTorch☆36Updated last year
- A high-performance constrained decoding engine based on context free grammar in Rust☆51Updated 4 months ago
- Tensor library for Zig☆12Updated 6 months ago
- research impl of Native Sparse Attention (2502.11089)☆54Updated 2 months ago
- Experimental GPU language with meta-programming☆22Updated 8 months ago
- Experimental compiler for deep learning models☆65Updated last month
- 👷 Build compute kernels☆38Updated this week
- ☆21Updated 2 months ago
- An implementation of the Llama architecture, to instruct and delight☆21Updated 4 months ago
- Experiment of using Tangent to autodiff triton☆78Updated last year
- A small rust-based data loader☆24Updated 5 months ago
- Profile your CoreML models directly from Python 🐍☆27Updated 7 months ago
- 8-bit floating point types for Rust☆47Updated 2 months ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Updated last year
- Graph model execution API for Candle☆14Updated 5 months ago
- A transformers like interface for interacting with local LLMs in Rust. This crate aims to provide the simplest interface to interact with…☆14Updated last week
- GPU based FFT written in Rust and CubeCL☆22Updated 2 months ago
- A diffusers API in Burn (Rust)☆19Updated 10 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated last year
- ☆39Updated 2 years ago