slashml / awesome-small-language-modelsLinks
☆122Updated last month
Alternatives and similar repositories for awesome-small-language-models
Users that are interested in awesome-small-language-models are comparing it to the libraries listed below
Sorting:
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆113Updated 4 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- A Demo of Cache-Augmented Generation (CAG) in an LLM☆120Updated 8 months ago
- Train LLM on Hugging Face infra☆67Updated 3 months ago
- ☆106Updated 10 months ago
- Coding an LLM and its building blocks from scratch.☆113Updated 10 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆61Updated last year
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe…☆278Updated 2 weeks ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Updated 10 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- From data to vector database effortlessly☆89Updated 8 months ago
- ☆80Updated 6 months ago
- chrome & firefox extension to chat with webpages: local llms☆131Updated last year
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- ☆112Updated 7 months ago
- Fastest way to build and deploy reliable AI agents, MCP tools and agent-to-agent. Deploy in a production ready serverless environment.☆147Updated last week
- ☆89Updated 2 weeks ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- A CLI to estimate inference memory requirements for Hugging Face models, written in Python.☆683Updated last week
- 🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.☆692Updated this week
- ☆56Updated last year
- Fine tune Gemma 3 on an object detection task☆97Updated 6 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆168Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- SynthGenAI - Package for Generating Synthetic Datasets using LLMs.☆54Updated 2 months ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆85Updated 5 months ago
- ☆212Updated 8 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆402Updated 3 months ago
- ☆124Updated 7 months ago