huggingface / xet-coreLinks
xet client tech, used in huggingface_hub
☆148Updated this week
Alternatives and similar repositories for xet-core
Users that are interested in xet-core are comparing it to the libraries listed below
Sorting:
- Rust crates for XetHub☆51Updated 9 months ago
- ☆231Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.☆33Updated this week
- 👷 Build compute kernels☆87Updated this week
- Module, Model, and Tensor Serialization/Deserialization☆250Updated this week
- A collection of reproducible inference engine benchmarks☆32Updated 3 months ago
- ☆38Updated this week
- High-performance safetensors model loader☆52Updated 2 weeks ago
- Simple high-throughput inference library☆125Updated 2 months ago
- clustering algorithm implementation☆13Updated 3 weeks ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆163Updated 3 weeks ago
- parallel fetch☆134Updated 3 weeks ago
- ☆30Updated 3 months ago
- This repository contains statistics about the AI Infrastructure products.☆18Updated 5 months ago
- ☆13Updated last year
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆192Updated 2 weeks ago
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆80Updated last year
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆137Updated last week
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆100Updated 4 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated 2 months ago
- ☆491Updated 3 months ago
- Super-fast Structured Outputs☆350Updated this week
- A Fish Speech implementation in Rust, with Candle.rs☆94Updated 2 months ago
- Unified storage framework for the entire machine learning lifecycle☆156Updated last year
- Rust implementation of Surya☆58Updated 5 months ago
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆81Updated 6 months ago
- ☆12Updated 6 months ago
- implement llava using candle☆15Updated last year
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆57Updated 3 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆98Updated 2 weeks ago