skypilot-org / skypilot-catalogLinks
☆27Updated this week
Alternatives and similar repositories for skypilot-catalog
Users that are interested in skypilot-catalog are comparing it to the libraries listed below
Sorting:
- Write a fast kernel and run it on Discord. See how you compare against the best!☆66Updated 3 weeks ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- AI-Driven Research Systems (ADRS)☆106Updated 3 weeks ago
- 👷 Build compute kernels☆198Updated 2 weeks ago
- LM engine is a library for pretraining/finetuning LLMs☆103Updated last week
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated 11 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆47Updated this week
- ☆219Updated 11 months ago
- ☆47Updated last year
- Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support☆232Updated this week
- A collection of reproducible inference engine benchmarks☆38Updated 8 months ago
- Easy, Fast, and Scalable Multimodal AI☆83Updated last week
- Memory optimized Mixture of Experts☆72Updated 5 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆141Updated 3 months ago
- LLM Serving Performance Evaluation Harness☆82Updated 10 months ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆148Updated 2 years ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆467Updated 2 weeks ago
- ☆48Updated last year
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆267Updated this week
- Simple high-throughput inference library☆155Updated 7 months ago
- Benchmark suite for LLMs from Fireworks.ai☆84Updated last month
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆109Updated 5 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Updated 3 months ago
- A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM☆182Updated this week
- Cray-LM unified training and inference stack.☆22Updated 11 months ago
- PyTorch centric eager mode debugger☆48Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆268Updated last month
- Google TPU optimizations for transformers models☆133Updated 3 weeks ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆66Updated 2 years ago
- Storing long contexts in tiny caches with self-study☆228Updated last month