huggingface / candle-cublasltLinks
☆13Updated last year
Alternatives and similar repositories for candle-cublaslt
Users that are interested in candle-cublaslt are comparing it to the libraries listed below
Sorting:
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 3 months ago
- A small python library to run iterators in a separate process☆10Updated last year
- Read and write tensorboard data using Rust☆21Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 7 months ago
- Tensor library for Zig☆11Updated 7 months ago
- GPU based FFT written in Rust and CubeCL☆23Updated 2 weeks ago
- Rust crate for some audio utilities☆24Updated 3 months ago
- Because it's there.☆16Updated 9 months ago
- 8-bit floating point types for Rust☆46Updated 3 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆55Updated last month
- A diffusers API in Burn (Rust)☆19Updated 11 months ago
- Graph model execution API for Candle☆13Updated 7 months ago
- 👷 Build compute kernels☆68Updated this week
- implement llava using candle☆15Updated last year
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated last year
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated last year
- ☆20Updated 8 months ago
- Build tools for LLMs in Rust using Model Context Protocol☆38Updated 4 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆23Updated 3 months ago
- A small rust-based data loader☆27Updated 2 weeks ago
- A text embedding extension for the Polars Dataframe library.☆24Updated 7 months ago
- ESRGAN implemented in rust with candle☆16Updated last year
- LLama implementations benchmarking framework☆12Updated last year
- Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments☆20Updated 11 months ago
- Rust Implementation of micrograd☆52Updated 11 months ago
- Modular Rust transformer/LLM library using Candle☆36Updated last year
- Ask shortgpt for instant and concise answers☆13Updated 2 years ago
- A high-performance constrained decoding engine based on context free grammar in Rust☆53Updated last month
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆10Updated last year
- FalkorDB-Browser is a visualization UI for FalkorDB.☆32Updated this week