huggingface / xet-coreLinks
xet client tech, used in huggingface_hub
☆393Updated last week
Alternatives and similar repositories for xet-core
Users that are interested in xet-core are comparing it to the libraries listed below
Sorting:
- Rust crates for XetHub☆76Updated last year
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang☆100Updated last week
- A minimalistic C++ Jinja templating engine for LLM chat templates☆202Updated 4 months ago
- Module, Model, and Tensor Serialization/Deserialization☆286Updated 5 months ago
- ☆538Updated 3 months ago
- ☆278Updated last week
- Super-fast Structured Outputs☆670Updated this week
- ☆44Updated last week
- PyTorch Single Controller☆953Updated this week
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆225Updated 3 weeks ago
- parallel fetch☆144Updated 2 months ago
- Faster structured generation☆272Updated this week
- Inference server benchmarking tool☆141Updated 3 months ago
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆65Updated 9 months ago
- Where GPUs get cooked 👩🍳🔥☆357Updated last week
- Official Python API client library for turbopuffer☆100Updated last week
- Fast block-level file diffs (e.g. for VM disk images) using CoW filesystem metadata☆246Updated 7 months ago
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆212Updated 6 months ago
- Efficent platform for inference and serving local LLMs including an OpenAI compatible API server.☆578Updated this week
- ☆463Updated 2 months ago
- Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects☆134Updated last month
- GGUF implementation in C as a library and a tools CLI program☆301Updated 5 months ago
- ☆31Updated 9 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆50Updated this week
- High-performance safetensors model loader☆93Updated 2 weeks ago
- Simple high-throughput inference library☆155Updated 8 months ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆474Updated 2 weeks ago
- 👷 Build compute kernels☆214Updated last week
- Securely run AI-generated code in stateful sandboxes that run forever.☆225Updated 9 months ago
- Transformer GPU VRAM estimator☆67Updated last year