microsoft / Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
☆2,457Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Phi-3CookBook
- Tools for merging pretrained large language models.☆4,788Updated this week
- PyTorch native finetuning library☆4,267Updated this week
- ☆1,878Updated last week
- SGLang is a fast serving framework for large language models and vision language models.☆5,919Updated this week
- nanoGPT style version of Llama 3.1☆1,231Updated 3 months ago
- ☆2,734Updated last month
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,356Updated this week
- DataComp for Language Models☆1,150Updated 2 weeks ago
- A native PyTorch Library for large model training☆2,579Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,026Updated last week
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,551Updated 2 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,608Updated 2 months ago
- Home of StarCoder2!☆1,775Updated 7 months ago
- Training LLMs with QLoRA + FSDP☆1,418Updated this week
- ☆1,260Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,117Updated this week
- A blazing fast inference solution for text embeddings models☆2,813Updated this week
- High-quality datasets, tools, and concepts for LLM fine-tuning.☆1,974Updated 2 weeks ago
- VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and…☆1,976Updated last week
- Build resilient language agents as graphs.☆6,531Updated this week
- One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure☆1,839Updated last week
- Reaching LLaMA2 Performance with 0.1M Dollars☆960Updated 3 months ago
- AIOS: LLM Agent Operating System☆3,390Updated this week
- Mixture-of-Experts for Large Vision-Language Models☆1,975Updated 5 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,293Updated 7 months ago
- DeepSeek LLM: Let there be answers☆1,438Updated 9 months ago
- Robust recipes to align language models with human and AI preferences☆4,663Updated last month
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,504Updated 3 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,036Updated 2 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,215Updated 2 months ago