microsoft / Phi-3CookBook
This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmark…
☆2,738Updated this week
Alternatives and similar repositories for Phi-3CookBook:
Users that are interested in Phi-3CookBook are comparing it to the libraries listed below
- PyTorch native post-training library☆4,856Updated this week
- ☆2,332Updated last week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,879Updated 3 weeks ago
- Tools for merging pretrained large language models.☆5,260Updated last week
- DataComp for Language Models☆1,230Updated 2 months ago
- ☆2,852Updated 5 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,448Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,250Updated this week
- SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?☆2,450Updated last week
- nanoGPT style version of Llama 3.1☆1,316Updated 6 months ago
- Sky-T1: Train your own O1 preview model within $450☆2,641Updated this week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,852Updated 3 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.☆10,325Updated this week
- Knowledge Agents and Management in the Cloud☆3,707Updated this week
- Generative AI extensions for onnxruntime☆615Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,809Updated 3 months ago
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆2,916Updated last week
- The Open Cookbook for Top-Tier Code Large Language Model☆1,614Updated 2 months ago
- ☆1,357Updated last week
- ☆4,058Updated 8 months ago
- Go ahead and axolotl questions☆8,648Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,362Updated last week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,506Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,434Updated 3 weeks ago
- Deploy your agentic worfklows to production☆1,964Updated this week
- Optimizing inference proxy for LLMs☆2,040Updated this week
- AllenAI's post-training codebase☆2,657Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,243Updated this week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,762Updated 4 months ago
- Everything about the SmolLM2 and SmolVLM family of models☆1,888Updated 2 weeks ago