huggingface / candle-cublaslt
☆13Updated last year
Alternatives and similar repositories for candle-cublaslt:
Users that are interested in candle-cublaslt are comparing it to the libraries listed below
- Tensor library for Zig☆12Updated 5 months ago
- Rust crate for some audio utilities☆23Updated 2 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- Proof of concept for running moshi/hibiki using webrtc☆18Updated 2 months ago
- 👷 Build compute kernels☆37Updated last week
- 8-bit floating point types for Rust☆47Updated last month
- A small python library to run iterators in a separate process☆10Updated last year
- Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments☆20Updated 10 months ago
- GPU based FFT written in Rust and CubeCL☆22Updated last month
- A text embedding extension for the Polars Dataframe library.☆24Updated 5 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆22Updated 2 months ago
- Training hybrid models for dummies.☆20Updated 3 months ago
- Graph model execution API for Candle☆14Updated 5 months ago
- Chunk Dedupe Estimation☆14Updated 6 months ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 7 months ago
- ☆11Updated 3 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 4 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆52Updated last week
- Because it's there.☆16Updated 7 months ago
- ☆29Updated 5 months ago
- Read and write tensorboard data using Rust☆21Updated last year
- A collection of optimisers for use with candle☆34Updated last week
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆36Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated last year
- A small rust-based data loader☆24Updated 4 months ago
- ☆19Updated last week
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 7 months ago
- First token cutoff sampling inference example☆30Updated last year