Self-hosted huggingface mirror service. 自建huggingface镜像服务。
☆241Mar 14, 2026Updated 2 months ago
Alternatives and similar repositories for olah
Users that are interested in olah are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Variant optimization autoscaler for distributed inference workloads☆42Updated this week
- a huggingface mirror site.☆337Apr 22, 2026Updated last month
- Efficient text classification with Pytorch☆23Nov 26, 2025Updated 6 months ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆27Apr 24, 2025Updated last year
- Experiment on metadata extraction using large language models such as GPT-3☆12Feb 1, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Test Orchestrator for Performance and Scalability of AI pLatforms☆18May 26, 2026Updated 2 weeks ago
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆30May 18, 2026Updated 3 weeks ago
- The Volcano Descheduler☆24Jan 24, 2025Updated last year
- rainbond operator 安装控制器☆12Updated this week
- Small, modern WASM bindings for libopus raw packet encode/decode.☆82Jun 1, 2026Updated last week
- Curriculum training of instruction-following LLMs with Unsloth☆14Dec 15, 2025Updated 5 months ago
- Helm charts for llm-d☆52Jul 22, 2025Updated 10 months ago
- EpochFS is a versioned cloud file system with git-like branching, transaction support.☆17Apr 23, 2026Updated last month
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- IPA Phonemizer/Dephonemizer for 140 human languages☆60May 6, 2026Updated last month
- A Lustre container storage interface that allows Kubernetes to mount/unmount provisioned Lustre filesystems into containers.☆46Jun 1, 2026Updated last week
- Tokenflood is a load testing framework for simulating arbitary loads on instruction-tuned LLMs☆45May 18, 2026Updated 3 weeks ago
- ☆21Updated this week
- GPU environment and cluster management with LLM support☆658May 16, 2024Updated 2 years ago
- A side project that follows all the acceleration tricks in tinyllama, with the minimal modification to the huggingface transformers code.☆13Sep 2, 2024Updated last year
- ☆20Updated this week
- ☆21Mar 11, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Inference server benchmarking tool☆160May 26, 2026Updated 2 weeks ago
- AI-generated text boundary detection with RoFT☆25Sep 9, 2024Updated last year
- Module, Model, and Tensor Serialization/Deserialization☆310Apr 30, 2026Updated last month
- DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding ap…☆160Dec 9, 2025Updated 6 months ago
- ☆77Updated this week
- It is very easy to switch from Docker Shim to CRI Dockerd and back☆31Oct 30, 2023Updated 2 years ago
- Chroma key (green screen removal) algorithms with Python☆11Jul 14, 2024Updated last year
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- A python module and REST API for automatic extraction of metadata from PDF files☆18Nov 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CPU DRA Driver☆52Updated this week
- Python package for Geometric / Clifford Algebra with Pytorch.☆15Jun 2, 2026Updated last week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆305Jan 26, 2026Updated 4 months ago
- Yet Another Z39.50-powered Chatbot☆13Oct 9, 2023Updated 2 years ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,827Mar 24, 2026Updated 2 months ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Sep 26, 2023Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago