Self-hosted huggingface mirror service. 自建huggingface镜像服务。
☆234Mar 14, 2026Updated 2 months ago
Alternatives and similar repositories for olah
Users that are interested in olah are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Variant optimization autoscaler for distributed inference workloads☆39Updated this week
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated last year
- Test Orchestrator for Performance and Scalability of AI pLatforms☆18May 11, 2026Updated last week
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆30May 13, 2026Updated last week
- rainbond operator 安装控制器☆12May 12, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated 2 years ago
- Helm charts for llm-d☆52Jul 22, 2025Updated 9 months ago
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 9 months ago
- Janus is an opensource IA for Star Citizen☆11Dec 23, 2023Updated 2 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆46Mar 20, 2025Updated last year
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆47Updated this week
- 4G GPU & 10 Minutes for train☆12Aug 9, 2023Updated 2 years ago
- Tokenflood is a load testing framework for simulating arbitary loads on instruction-tuned LLMs☆45May 12, 2026Updated last week
- ☆21Nov 4, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆20May 12, 2026Updated last week
- GPU environment and cluster management with LLM support☆660May 16, 2024Updated 2 years ago
- A side project that follows all the acceleration tricks in tinyllama, with the minimal modification to the huggingface transformers code.☆13Sep 2, 2024Updated last year
- ☆33Apr 19, 2025Updated last year
- Inference server benchmarking tool☆158Apr 24, 2026Updated 3 weeks ago
- Данный проект основан на llama.cpp и компилирует только RPC-сервер, а так же вспомогательные утилиты, работающие в режиме RPC-клиента, не…☆24May 25, 2025Updated 11 months ago
- dbSurface is a SQL editor made for pgvector.☆23Dec 6, 2025Updated 5 months ago
- AI-generated text boundary detection with RoFT☆25Sep 9, 2024Updated last year
- A python library by Algovera to interact with IPFS and IPFS ecosystem such as the common pinning services.☆19Nov 17, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Module, Model, and Tensor Serialization/Deserialization☆308Apr 30, 2026Updated 2 weeks ago
- DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding ap…☆160Dec 9, 2025Updated 5 months ago
- ClearML Fractional GPU - Run multiple containers on the same GPU with driver level memory limitation ✨ and compute time-slicing☆92Mar 12, 2026Updated 2 months ago
- ☆76May 13, 2026Updated last week
- It is very easy to switch from Docker Shim to CRI Dockerd and back☆31Oct 30, 2023Updated 2 years ago
- ktransformers v0.3 docker build and run☆12Feb 24, 2025Updated last year
- Razer Chroma SDK for Rust☆12Jun 8, 2019Updated 6 years ago
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- Deliver LLMs of GGUF format via Dockerfile.☆15Oct 24, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆304Jan 26, 2026Updated 3 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,798Mar 24, 2026Updated last month
- a windows app that enables discord events to trigger razer chroma lighting on supported devices☆10Feb 25, 2023Updated 3 years ago
- ☆14Oct 29, 2020Updated 5 years ago
- pspmigrator is a tool to migrate from PSP to PSA☆29Sep 14, 2023Updated 2 years ago
- The best authentication plugin☆12Dec 19, 2025Updated 5 months ago
- Dragonfly related community material☆13May 7, 2026Updated last week