skypilot-org / sky-llama
☆25Updated last year
Alternatives and similar repositories for sky-llama:
Users that are interested in sky-llama are comparing it to the libraries listed below
- Tutorial to get started with SkyPilot!☆56Updated 9 months ago
- ☆54Updated 3 weeks ago
- Small, simple agent task environments for training and evaluation☆18Updated 3 months ago
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆75Updated last month
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆80Updated last week
- Tune efficiently any LLM model from HuggingFace using distributed training (multiple GPU) and DeepSpeed. Uses Ray AIR to orchestrate the …☆55Updated last year
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆34Updated this week
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆103Updated 8 months ago
- Pre-training code for CrystalCoder 7B LLM☆55Updated 9 months ago
- ReLM is a Regular Expression engine for Language Models☆103Updated last year
- Benchmark suite for LLMs from Fireworks.ai☆65Updated this week
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆40Updated last year
- ☆22Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆107Updated 2 months ago
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆37Updated 2 years ago
- Train, tune, and infer Bamba model☆83Updated last month
- LLM Optimize is a proof-of-concept library for doing LLM (large language model) guided blackbox optimization.☆53Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆24Updated 2 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated last month
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models"☆59Updated 4 months ago
- ☆59Updated last week
- ☆43Updated 7 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- r2e: turn any github repository into a programming agent environment☆100Updated 2 weeks ago
- ☆21Updated last year
- LLM finetuning☆42Updated last year
- ☆37Updated 2 years ago