huggingface / xet-coreLinks
xet client tech, used in huggingface_hub
☆107Updated this week
Alternatives and similar repositories for xet-core
Users that are interested in xet-core are comparing it to the libraries listed below
Sorting:
- ☆27Updated last month
- Rust crates for XetHub☆44Updated 7 months ago
- 👷 Build compute kernels☆44Updated this week
- A collection of reproducible inference engine benchmarks☆31Updated last month
- ☆214Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.☆30Updated this week
- TRITONCACHE implementation of a Redis cache☆13Updated 2 weeks ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆92Updated 2 months ago
- The driver for LMCache core to run in vLLM☆41Updated 3 months ago
- ☆39Updated 2 years ago
- ANE accelerated embedding models!☆17Updated 5 months ago
- High-performance safetensors model loader☆34Updated this week
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆29Updated 2 months ago
- Rust implementation of Surya☆58Updated 3 months ago
- Model Context Protocol Server for Apache OpenDAL™☆29Updated last month
- Simple dependency injection framework for Python☆21Updated last year
- ☆11Updated 4 months ago
- ☆13Updated last year
- Extract core logic from qdrant and make it available as a library.☆58Updated last year
- This repository contains statistics about the AI Infrastructure products.☆18Updated 3 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated 3 weeks ago
- Load compute kernels from the Hub☆139Updated this week
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆88Updated last week
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- Benchmark suite for LLMs from Fireworks.ai☆75Updated 2 weeks ago
- Simple high-throughput inference library☆115Updated 2 weeks ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆54Updated last month
- implement llava using candle☆15Updated 11 months ago
- clustering algorithm implementation☆13Updated last month
- ☆25Updated 5 months ago