huggingface / candle-cublaslt
☆12Updated last year
Alternatives and similar repositories for candle-cublaslt:
Users that are interested in candle-cublaslt are comparing it to the libraries listed below
- A small python library to run iterators in a separate process☆10Updated last year
- Rust crate for some audio utilities☆22Updated 3 weeks ago
- ☆11Updated 2 months ago
- Tensor library for Zig☆11Updated 4 months ago
- Proof of concept for running moshi/hibiki using webrtc☆18Updated last month
- Read and write tensorboard data using Rust☆20Updated last year
- ESRGAN implemented in rust with candle☆15Updated last year
- A small rust-based data loader☆24Updated 3 months ago
- Training hybrid models for dummies.☆20Updated 2 months ago
- ☆12Updated last year
- Rust bindings for CTranslate2☆14Updated last year
- 8-bit floating point types for Rust☆46Updated 2 weeks ago
- GPU based FFT written in Rust and CubeCL☆21Updated 2 weeks ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 3 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆37Updated last year
- A text embedding extension for the Polars Dataframe library.☆24Updated 4 months ago
- implement llava using candle☆14Updated 9 months ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 6 months ago
- 👷 Build compute kernels☆24Updated this week
- Because it's there.☆16Updated 6 months ago
- AirLLM 70B inference with single 4GB GPU☆12Updated 7 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 4 months ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆75Updated 3 weeks ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆20Updated 3 weeks ago
- Graph model execution API for Candle☆13Updated 4 months ago
- LLama implementations benchmarking framework☆12Updated last year
- ☆23Updated this week
- Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments☆19Updated 8 months ago
- ANE accelerated embedding models!☆17Updated 3 months ago
- A collection of optimisers for use with candle☆34Updated 4 months ago