huggingface / xet-coreLinks
xet client tech, used in huggingface_hub
☆292Updated last week
Alternatives and similar repositories for xet-core
Users that are interested in xet-core are comparing it to the libraries listed below
Sorting:
- Rust crates for XetHub☆69Updated 11 months ago
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.☆79Updated this week
- A minimalistic C++ Jinja templating engine for LLM chat templates☆187Updated 2 weeks ago
- Super-fast Structured Outputs☆539Updated last week
- ☆509Updated 5 months ago
- ☆255Updated last week
- Module, Model, and Tensor Serialization/Deserialization☆267Updated last month
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆59Updated 5 months ago
- parallel fetch☆138Updated 3 weeks ago
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆204Updated this week
- A CLI tool for managing Claude instances with git worktree☆75Updated 3 weeks ago
- Model Context Protocol Server for Apache OpenDAL™☆33Updated 6 months ago
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆195Updated 2 months ago
- Fast block-level file diffs (e.g. for VM disk images) using CoW filesystem metadata☆213Updated 3 months ago
- ☆440Updated last month
- Verify Precision of all Kimi K2 API Vendor☆188Updated 2 weeks ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆107Updated 7 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆41Updated last week
- Rust implementation of Surya☆60Updated 7 months ago
- Official Python API client library for turbopuffer☆78Updated this week
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆82Updated 8 months ago
- Transformer GPU VRAM estimator☆66Updated last year
- ☆139Updated last year
- Benchmark and optimize LLM inference across frameworks with ease☆113Updated 3 weeks ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆479Updated 2 weeks ago
- ☆232Updated 3 months ago
- GGUF implementation in C as a library and a tools CLI program☆291Updated last month
- Simple high-throughput inference library☆142Updated 4 months ago
- Self-host LLMs with vLLM and BentoML☆150Updated last week
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆88Updated last year