huggingface / xet-core
xet client tech, used in huggingface_hub
β92Updated this week
Alternatives and similar repositories for xet-core:
Users that are interested in xet-core are comparing it to the libraries listed below
- π· Build compute kernelsβ37Updated this week
- Rust crates for XetHubβ43Updated 6 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rustβ52Updated 2 weeks ago
- Model Context Protocol Server for Apache OpenDALβ’β28Updated last month
- Rust implementation of Suryaβ58Updated 2 months ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedbackβ84Updated 2 months ago
- clustering algorithm implementationβ13Updated last week
- parallel fetchβ128Updated last week
- β207Updated this week
- β25Updated 3 weeks ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.β29Updated last month
- Super-fast Structured Outputsβ227Updated this week
- vLLM adapter for a TGIS-compatible gRPC server.β27Updated this week
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)β28Updated last year
- First token cutoff sampling inference exampleβ30Updated last year
- TRITONCACHE implementation of a Redis cacheβ13Updated 3 weeks ago
- This repository contains statistics about the AI Infrastructure products.β18Updated 2 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!β32Updated last month
- Workflow Defined Engineβ24Updated 3 weeks ago
- LLM-as-SERPβ65Updated 2 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templatesβ137Updated this week
- β45Updated 2 weeks ago
- Load compute kernels from the Hubβ116Updated this week
- β11Updated 3 months ago
- β54Updated last month
- β13Updated last year
- Inference server benchmarking toolβ57Updated 2 weeks ago
- ANE accelerated embedding models!β16Updated 5 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inferenceβ60Updated this week
- Auto Thinking Mode switch for Qwen3 in Open webuiβ50Updated this week