skypilot-org / skypilotLinks

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

☆8,409

Alternatives and similar repositories for skypilot

Users that are interested in skypilot are comparing it to the libraries listed below

Sorting:

huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,352Updated this week
bentoml / OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
☆11,591Updated this week
dottxt-ai / outlines
Structured Outputs
☆12,149Updated this week
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,327Updated 2 months ago
eth-sri / lmql
A language for constraint-guided and efficient LLM programming.
☆4,011Updated 2 months ago
Lightning-AI / litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆12,547Updated this week
neuml / txtai
💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows
☆11,248Updated last week
guardrails-ai / guardrails
Adding guardrails to large language models.
☆5,302Updated last week
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆7,330Updated this week
microsoft / LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…
☆5,285Updated 4 months ago
567-labs / instructor
structured outputs for llms
☆11,016Updated this week
sgl-project / sglang
SGLang is a fast serving framework for large language models and vision language models.
☆16,236Updated this week
stanfordnlp / dspy
DSPy: The framework for programming—not prompting—language models
☆26,664Updated this week
ggml-org / ggml
Tensor library for machine learning
☆12,859Updated this week
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆52,682Updated this week
pytorch / torchtune
PyTorch native post-training library
☆5,361Updated this week
bigscience-workshop / petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
☆9,731Updated 10 months ago
zilliztech / GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
☆7,649Updated 2 weeks ago
lm-sys / RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality
☆4,115Updated 11 months ago
1rgs / jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
☆4,776Updated last year
jzhang38 / TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆8,654Updated last year
huggingface / text-embeddings-inference
A blazing fast inference solution for text embeddings models
☆3,816Updated 2 weeks ago
bentoml / BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
☆7,908Updated this week
axolotl-ai-cloud / axolotl
Go ahead and axolotl questions
☆9,985Updated this week
argilla-io / argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
☆4,585Updated last week
PrefectHQ / marvin
an ambient intelligence library
☆5,825Updated this week
tensorchord / Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
☆5,121Updated last month
chroma-core / chroma
the AI-native open-source embedding database
☆21,290Updated this week
ray-project / llm-numbers
Numbers every LLM developer should know
☆4,246Updated last year
mit-han-lab / streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆6,936Updated last year