microsoft / PhiCookBookLinks
This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmark…
☆3,630Updated this week
Alternatives and similar repositories for PhiCookBook
Users that are interested in PhiCookBook are comparing it to the libraries listed below
Sorting:
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,623Updated 3 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,401Updated 8 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,499Updated last month
- ☆3,055Updated last month
- ☆1,857Updated this week
- ☆2,123Updated last week
- PyTorch native post-training library☆5,629Updated this week
- One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure☆2,397Updated 7 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,701Updated 2 months ago
- DataComp for Language Models☆1,402Updated 3 months ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,967Updated this week
- Generative AI extensions for onnxruntime☆911Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,995Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,479Updated last year
- ☆4,247Updated 4 months ago
- Optimizing inference proxy for LLMs☆3,250Updated this week
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.☆3,746Updated this week
- Deploy your agentic worfklows to production☆2,065Updated 2 weeks ago
- Large Concept Models: Language modeling in a sentence representation space☆2,313Updated 11 months ago
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆1,430Updated 2 months ago
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,706Updated last month
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,570Updated 7 months ago
- AllenAI's post-training codebase☆3,474Updated this week
- The Open Cookbook for Top-Tier Code Large Language Model☆1,966Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,155Updated this week
- Open-source AI cookbook☆2,555Updated last month
- A course on aligning smol models.☆6,550Updated last month
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,327Updated last year
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,220Updated 3 months ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆3,106Updated 7 months ago