skypilot-org / skypilot-catalog
☆20Updated this week
Related projects ⓘ
Alternatives and complementary repositories for skypilot-catalog
- ☆38Updated 4 months ago
- A minimal implementation of vllm.☆30Updated 3 months ago
- ☆26Updated last year
- Tutorial to get started with SkyPilot!☆56Updated 6 months ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆61Updated last month
- Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆79Updated this week
- ☆39Updated 10 months ago
- ☆30Updated 2 years ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆60Updated last year
- ☆19Updated last year
- Elixir: Train a Large Language Model on a Small GPU Cluster☆13Updated last year
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆128Updated 11 months ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆40Updated last week
- The official repo for "LLoCo: Learning Long Contexts Offline"☆113Updated 5 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"☆56Updated last month
- Lightning support for Intel Habana accelerators.☆25Updated this week
- Benchmark suite for LLMs from Fireworks.ai☆58Updated 2 weeks ago
- Google TPU optimizations for transformers models☆75Updated this week
- ☆21Updated last week
- ☆24Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- ☆120Updated this week
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆18Updated this week
- PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…☆31Updated last year
- ☆99Updated last month
- A safetensors extension to efficiently store sparse quantized tensors on disk☆50Updated this week
- A resilient distributed training framework☆85Updated 7 months ago
- LLM Serving Performance Evaluation Harness☆56Updated 2 months ago
- How much energy do GenAI models consume?☆41Updated last month
- extensible collectives library in triton☆72Updated last month