skypilot-org / skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
☆6,814Updated this week
Related projects ⓘ
Alternatives and complementary repositories for skypilot
- Large Language Model Text Generation Inference☆9,122Updated this week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,248Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆30,423Updated this week
- Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.☆10,079Updated this week
- Structured Text Generation☆9,487Updated this week
- DSPy: The framework for programming—not prompting—language models☆18,885Updated this week
- Go ahead and axolotl questions☆7,930Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆6,127Updated this week
- Adding guardrails to large language models.☆4,127Updated this week
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,464Updated 8 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆13,971Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,059Updated 5 months ago
- A language for constraint-guided and efficient LLM programming.☆3,699Updated 5 months ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆9,182Updated this week
- the AI-native open-source embedding database☆15,448Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,653Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,487Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,269Updated 3 months ago
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.☆4,190Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆6,299Updated this week
- A guidance language for controlling large language models.☆19,118Updated last week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆12,427Updated last month
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,238Updated 2 months ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆5,994Updated 2 months ago
- Supercharge Your LLM Application Evaluations 🚀☆7,261Updated this week
- Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!☆4,766Updated this week
- Numbers every LLM developer should know☆4,103Updated 10 months ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,497Updated last month
- structured outputs for llms☆8,225Updated this week