slashml / awesome-small-language-modelsLinks
☆102Updated last year
Alternatives and similar repositories for awesome-small-language-models
Users that are interested in awesome-small-language-models are comparing it to the libraries listed below
Sorting:
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆75Updated 9 months ago
- ☆104Updated 9 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last week
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 7 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆61Updated 11 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆167Updated last year
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe…☆269Updated last month
- Train LLM on Hugging Face infra☆67Updated last month
- A Demo of Cache-Augmented Generation (CAG) in an LLM☆119Updated 7 months ago
- ☆46Updated 9 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆168Updated 4 months ago
- Agentic RAG to help you build a startup🚀☆55Updated 9 months ago
- chrome & firefox extension to chat with webpages: local llms☆130Updated last year
- Fine tune Gemma 3 on an object detection task☆95Updated 5 months ago
- Learn Pydantic AI agents, step by step, using local models and ollama☆141Updated 6 months ago
- Coding an LLM and its building blocks from scratch.☆106Updated 9 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆108Updated 3 months ago
- An agentic AI application that allows you to chat with your papers and gather also information from papers on ArXiv and on PubMed☆154Updated 7 months ago
- 📓 A collection of generative AI open-source repositories that are actively being developed. If you are looking to build a solid profile …☆85Updated 3 months ago
- ☆108Updated 6 months ago
- ☆89Updated 9 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆400Updated 2 months ago
- From data to vector database effortlessly☆88Updated 7 months ago
- AI agent with RAG+ReAct on Indian Constitution & BNS☆77Updated 6 months ago
- ☆26Updated last year
- Low memory full parameter finetuning of LLMs☆53Updated 5 months ago
- Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️☆47Updated last year
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆456Updated 4 months ago