skypilot-org / skypilot-catalogLinks
☆25Updated this week
Alternatives and similar repositories for skypilot-catalog
Users that are interested in skypilot-catalog are comparing it to the libraries listed below
Sorting:
- A collection of reproducible inference engine benchmarks☆31Updated 2 months ago
- ☆44Updated last year
- Tutorial to get started with SkyPilot!☆58Updated last year
- ☆30Updated 7 months ago
- ☆47Updated last year
- Learn CUDA with PyTorch☆27Updated this week
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆116Updated 6 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆46Updated this week
- Cray-LM unified training and inference stack.☆22Updated 4 months ago
- train with kittens!☆60Updated 8 months ago
- Storing long contexts in tiny caches with self-study☆67Updated last week
- A minimal implementation of vllm.☆44Updated 11 months ago
- LLM Serving Performance Evaluation Harness☆78Updated 4 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆59Updated 8 months ago
- python package of rocm-smi-lib☆21Updated 9 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆130Updated this week
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Updated last year
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆126Updated 6 months ago
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆61Updated 5 months ago
- A lightweight, user-friendly data-plane for LLM training.☆19Updated 2 months ago
- ☆30Updated 2 years ago
- PyTorch centric eager mode debugger☆47Updated 6 months ago
- ML/DL Math and Method notes☆61Updated last year
- vLLM adapter for a TGIS-compatible gRPC server.☆32Updated this week
- Compression for Foundation Models☆31Updated 3 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆95Updated last month
- ☆21Updated 3 months ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆66Updated 6 months ago
- Load compute kernels from the Hub☆191Updated last week
- ☆182Updated 2 months ago