skypilot-org / skypilot-catalog
☆22Updated this week
Alternatives and similar repositories for skypilot-catalog:
Users that are interested in skypilot-catalog are comparing it to the libraries listed below
- Tutorial to get started with SkyPilot!☆57Updated 10 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆52Updated this week
- ☆45Updated 9 months ago
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆110Updated 3 months ago
- How much energy do GenAI models consume?☆42Updated 5 months ago
- ☆27Updated 4 months ago
- ☆28Updated last year
- Compression for Foundation Models☆27Updated this week
- ☆14Updated last month
- ❓Curie: Automated and Rigorous Scientific Experimentation with AI Agents☆48Updated last week
- Cray-LM unified training and inference stack.☆21Updated last month
- Stateful LLM Serving☆50Updated 2 weeks ago
- A framework for PyTorch to enable fault management for collective communication libraries (CCL) such as NCCL☆19Updated last week
- The backend behind the LLM-Perf Leaderboard☆10Updated 10 months ago
- ☆43Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆63Updated last year
- A minimal implementation of vllm.☆37Updated 8 months ago
- Benchmark suite for LLMs from Fireworks.ai☆70Updated last month
- LLM Serving Performance Evaluation Harness☆73Updated last month
- Train, tune, and infer Bamba model☆86Updated 2 months ago
- Load compute kernels from the Hub☆99Updated this week
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆64Updated 3 months ago
- ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).☆37Updated 6 months ago
- ☆62Updated last month
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆10Updated last month
- vLLM adapter for a TGIS-compatible gRPC server.☆25Updated this week
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆150Updated 6 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆34Updated this week
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆27Updated 4 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated last year