slashml / awesome-small-language-modelsLinks
☆121Updated 3 weeks ago
Alternatives and similar repositories for awesome-small-language-models
Users that are interested in awesome-small-language-models are comparing it to the libraries listed below
Sorting:
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- Train LLM on Hugging Face infra☆67Updated 2 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- ☆105Updated 10 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆110Updated 4 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆61Updated 11 months ago
- A Demo of Cache-Augmented Generation (CAG) in an LLM☆120Updated 7 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- Fine tune Gemma 3 on an object detection task☆96Updated 6 months ago
- Fastest way to build and deploy reliable AI agents, MCP tools and agent-to-agent. Deploy in a production ready serverless environment.☆145Updated last week
- chrome & firefox extension to chat with webpages: local llms☆131Updated last year
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- Coding an LLM and its building blocks from scratch.☆113Updated 10 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Updated 9 months ago
- ☆79Updated 6 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- ☆26Updated last year
- ~950 line, minimal, extensible LLM inference engine built from scratch.☆405Updated 3 weeks ago
- ☆46Updated 10 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆260Updated last week
- An agentic AI application that allows you to chat with your papers and gather also information from papers on ArXiv and on PubMed☆152Updated 8 months ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆85Updated 4 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆402Updated 2 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆496Updated 5 months ago
- From data to vector database effortlessly☆90Updated 8 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆97Updated 8 months ago
- Learn Pydantic AI agents, step by step, using local models and ollama☆142Updated 7 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- ☆181Updated 11 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago