skypilot-org / sky-llama
☆25Updated last year
Alternatives and similar repositories for sky-llama:
Users that are interested in sky-llama are comparing it to the libraries listed below
- ☆21Updated this week
- Tutorial to get started with SkyPilot!☆56Updated 8 months ago
- Elixir: Train a Large Language Model on a Small GPU Cluster☆13Updated last year
- Train, tune, and infer Bamba model☆75Updated this week
- ☆53Updated 3 weeks ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated last year
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the …☆53Updated last year
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆75Updated last week
- The official repo for "LLoCo: Learning Long Contexts Offline"☆114Updated 7 months ago
- r2e: turn any github repository into a programming agent environment☆94Updated 2 weeks ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆107Updated last month
- Data preparation code for CrystalCoder 7B LLM☆43Updated 8 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆40Updated 9 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated 8 months ago
- Self-host LLMs with LMDeploy and BentoML☆17Updated 3 weeks ago
- LLM finetuning☆43Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆101Updated 7 months ago
- ☆43Updated 6 months ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆62Updated last month
- Example of applying CUDA graphs to LLaMA-v2☆10Updated last year
- ReLM is a Regular Expression engine for Language Models☆103Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- ☆43Updated 2 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆62Updated last year
- Cascade Speculative Drafting☆28Updated 9 months ago
- Data preparation code for Amber 7B LLM☆84Updated 8 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆38Updated last month
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems☆89Updated this week