skypilot-org / skypilotLinks
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
☆8,347Updated this week
Alternatives and similar repositories for skypilot
Users that are interested in skypilot are comparing it to the libraries listed below
Sorting:
- Large Language Model Text Generation Inference☆10,311Updated this week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,710Updated 10 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆15,932Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,550Updated last year
- Numbers every LLM developer should know☆4,240Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,927Updated last year
- Tensor library for machine learning☆12,808Updated this week
- Structured Outputs☆12,084Updated this week
- Go ahead and axolotl questions☆9,852Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,233Updated this week
- A language for constraint-guided and efficient LLM programming.☆3,992Updated last month
- Accessible large language models via k-bit quantization for PyTorch.☆7,212Updated this week
- Containers for machine learning☆8,704Updated this week
- Universal LLM Deployment Engine with ML Compilation☆20,950Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆52,204Updated this week
- Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.☆6,983Updated this week
- Simple, safe way to store and distribute tensors☆3,345Updated last week
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,071Updated last week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,540Updated last week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,627Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,474Updated this week
- DSPy: The framework for programming—not prompting—language models☆26,357Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,274Updated last month
- An awesome & curated list of best LLMOps tools for developers☆5,085Updated 3 weeks ago
- Tools for merging pretrained large language models.☆6,016Updated 3 weeks ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,228Updated this week
- PyTorch native post-training library☆5,323Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,631Updated last year
- Python bindings for llama.cpp☆9,335Updated last week
- Development repository for the Triton language and compiler☆16,114Updated this week