microsoft / Phi-3CookBook

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmark…

☆2,738

Alternatives and similar repositories for Phi-3CookBook:

Users that are interested in Phi-3CookBook are comparing it to the libraries listed below

pytorch / torchtune
PyTorch native post-training library
☆4,856Updated this week
openai / simple-evals
☆2,332Updated last week
microsoft / LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…
☆4,879Updated 3 weeks ago
arcee-ai / mergekit
Tools for merging pretrained large language models.
☆5,260Updated last week
mlfoundations / dclm
DataComp for Language Models
☆1,230Updated 2 months ago
mistralai / mistral-finetune
☆2,852Updated 5 months ago
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,448Updated this week
facebookresearch / MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
☆1,250Updated this week
swe-bench / SWE-bench
SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?
☆2,450Updated last week
karpathy / nano-llama31
nanoGPT style version of Llama 3.1
☆1,316Updated 6 months ago
NovaSky-AI / SkyThought
Sky-T1: Train your own O1 preview model within $450
☆2,641Updated this week
QwenLM / Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
☆5,852Updated 3 weeks ago
sgl-project / sglang
SGLang is a fast serving framework for large language models and vision language models.
☆10,325Updated this week
run-llama / llama_cloud_services
Knowledge Agents and Management in the Cloud
☆3,707Updated this week
microsoft / onnxruntime-genai
Generative AI extensions for onnxruntime
☆615Updated this week
ictnlp / LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…
☆2,809Updated 3 months ago
NVlabs / VILA
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…
☆2,916Updated last week
OpenCoder-llm / OpenCoder-llm
The Open Cookbook for Top-Tier Code Large Language Model
☆1,614Updated 2 months ago
microsoft / vscode-ai-toolkit
☆1,357Updated last week
openai / transformer-debugger
☆4,058Updated 8 months ago
axolotl-ai-cloud / axolotl
Go ahead and axolotl questions
☆8,648Updated this week
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆2,362Updated last week
pytorch / torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
☆3,506Updated this week
facebookresearch / lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
☆4,434Updated 3 weeks ago
run-llama / llama_deploy
Deploy your agentic worfklows to production
☆1,964Updated this week
codelion / optillm
Optimizing inference proxy for LLMs
☆2,040Updated this week
allenai / open-instruct
AllenAI's post-training codebase
☆2,657Updated this week
meta-llama / llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…
☆16,243Updated this week
deepseek-ai / DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
☆4,762Updated 4 months ago
huggingface / smollm
Everything about the SmolLM2 and SmolVLM family of models
☆1,888Updated 2 weeks ago