huggingface / xet-coreLinks
xet client tech, used in huggingface_hub
☆171Updated this week
Alternatives and similar repositories for xet-core
Users that are interested in xet-core are comparing it to the libraries listed below
Sorting:
- Rust crates for XetHub☆52Updated 10 months ago
- Module, Model, and Tensor Serialization/Deserialization☆256Updated last week
- ☆238Updated last week
- vLLM adapter for a TGIS-compatible gRPC server.☆35Updated last week
- High-performance safetensors model loader☆53Updated last month
- A minimalistic C++ Jinja templating engine for LLM chat templates☆170Updated 2 weeks ago
- 👷 Build compute kernels☆106Updated last week
- ☆38Updated 2 weeks ago
- A collection of reproducible inference engine benchmarks☆32Updated 4 months ago
- Simple high-throughput inference library☆126Updated 3 months ago
- ☆501Updated 4 months ago
- ☆401Updated this week
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆58Updated 3 months ago
- ☆12Updated 6 months ago
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.☆69Updated this week
- ☆31Updated 4 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆139Updated last year
- Transformer GPU VRAM estimator☆66Updated last year
- A client library in Rust for Nvidia Triton.☆30Updated 2 years ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆270Updated last year
- Benchmarking suite for popular AI APIs☆88Updated 6 months ago
- This repository contains statistics about the AI Infrastructure products.☆17Updated 5 months ago
- Super-fast Structured Outputs☆432Updated last week
- Unified storage framework for the entire machine learning lifecycle☆156Updated last year
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆198Updated last month
- The driver for LMCache core to run in vLLM☆47Updated 6 months ago
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆82Updated 7 months ago
- implement llava using candle☆15Updated last year
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆29Updated 4 months ago
- Efficient vector database for hundred millions of embeddings.☆207Updated last year