huggingface / candle-paged-attention
☆12Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for candle-paged-attention
- ☆17Updated last month
- implement llava using candle☆13Updated 5 months ago
- A collection of optimisers for use with candle☆31Updated this week
- ☆19Updated 4 months ago
- ☆22Updated this week
- Read and write tensorboard data using Rust☆17Updated 9 months ago
- ☆25Updated last year
- ☆76Updated 5 months ago
- Automatically derive Python dunder methods for your Rust code☆13Updated 3 months ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆41Updated 2 weeks ago
- Sample Python extension using Rust/PyO3/tch to interact with PyTorch☆32Updated 9 months ago
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆39Updated 2 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆22Updated 2 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆37Updated last year
- A small python library to run iterators in a separate process☆10Updated 8 months ago
- Low rank adaptation (LoRA) for Candle.☆127Updated 3 months ago
- Port of Andrej Karpathy's minbpe to Rust☆19Updated 6 months ago
- An extension library to Candle that provides PyTorch functions not currently available in Candle☆37Updated 8 months ago
- ☆17Updated last month
- A diffusers API in Burn (Rust)☆16Updated 4 months ago
- ESRGAN implemented in rust with candle☆14Updated 11 months ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Updated last year
- LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed!☆101Updated last year
- Experimentation using the xla compiler from rust☆89Updated 3 months ago
- Experiment of using Tangent to autodiff triton☆72Updated 10 months ago
- Implementing the BitNet model in Rust☆29Updated 7 months ago
- Rust client for the huggingface hub aiming for minimal subset of features over `huggingface-hub` python package☆155Updated 2 months ago
- ☆123Updated 6 months ago
- A Rust Library for High-Performance Tensor Exchange with Python☆39Updated last week
- Your one stop CLI for ONNX model analysis.☆45Updated 2 years ago