slashml/awesome-small-language-models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/slashml/awesome-small-language-models)

slashml / awesome-small-language-models

☆129

Alternatives and similar repositories for awesome-small-language-models

Users that are interested in awesome-small-language-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

patthub / wsb_programowanie
View on GitHub
☆33Jun 22, 2026Updated 3 weeks ago
grounded-ai / grounded_ai
View on GitHub
Evals that meet you where you are. For AI that's grounded.
☆69Mar 21, 2026Updated 4 months ago
BelG13 / nl2sh
View on GitHub
Local CLI tool that lets you write natural language instructions and get the corresponding shell commands generated by a small language m…
☆21Nov 18, 2025Updated 8 months ago
hypeprlane / complex-pdf-rag
View on GitHub
☆19Jan 24, 2026Updated 5 months ago
Reaper2403 / slm-llm-grounding-playbook
View on GitHub
Architecture pattern for combining a fast LLM voice loop with a slower SLM that tracks hard facts.
☆15Apr 27, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
siddharth-Kharche / Finance-Agent-Powered-by-DeepSeek-R1
View on GitHub
☆15Jan 28, 2025Updated last year
Data4Democracy / are-you-fake-news
View on GitHub
☆17Jan 24, 2018Updated 8 years ago
GlassyWing / TorchDiffusion
View on GitHub
One Diffusion model implementation base on LibTorch
☆13Mar 22, 2023Updated 3 years ago
fishiatee / Tumera
View on GitHub
Yet another frontend for LLM, written using .NET and WinUI 3
☆11Sep 14, 2025Updated 10 months ago
muellerzr / nbdistributed
View on GitHub
Seemless interface of using PyTOrch distributed with Jupyter notebooks
☆59Sep 15, 2025Updated 10 months ago
the-ai-merge / multimodal-agents-course
View on GitHub
An MCP Multimodal AI Agent with eyes and ears!
☆575Jan 5, 2026Updated 6 months ago
AstraBert / Pokemon-Bot
View on GitHub
Discord bot that knows a lot about Pokemons :)
☆14Dec 25, 2024Updated last year
latenceainew / vllm-factory
View on GitHub
Production inference for encoder models - ColBERT, GLiNER, ColPali, embeddings etc. - as vLLM plugins for online and in-process deploymen…
☆75Jul 6, 2026Updated 2 weeks ago
awslabs / state-space-models-neuron
View on GitHub
☆16Apr 11, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dragonHyeon / HomeGAN
View on GitHub
Official pytorch implementation of the paper: "HomeGAN: Two stage GAN for enhanced floor plan image generation"
☆11Aug 9, 2023Updated 2 years ago
mallahyari / multimodal-image-search
View on GitHub
A Web app demonstrating multimodal image search using Visualized-BGE model
☆16Dec 1, 2024Updated last year
agershun / llamajs
View on GitHub
☆11Sep 18, 2023Updated 2 years ago
NachiketGadekar1 / browserllama
View on GitHub
Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.
☆23Oct 24, 2024Updated last year
Cenrax / AdvancedAITutorials
View on GitHub
This repository contains various RAG patterns implemented from scratch
☆20Dec 12, 2025Updated 7 months ago
RichardWallis / bibframe2schema
View on GitHub
Working files for the Bibframe2Schema.org Working Group
☆11Oct 25, 2023Updated 2 years ago
gigit0000 / qwen3.c
View on GitHub
Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.
☆25Sep 1, 2025Updated 10 months ago
tomsmoker / llm_kg_generator
View on GitHub
☆17Nov 6, 2023Updated 2 years ago
pavanjava / agentic-news
View on GitHub
this repository is a best example of agentic news team which coordinate and gets the news according to each agent.
☆14Dec 14, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jaggzh / z-old
View on GitHub
LLM CLI Interface - Extremely Convenient and Fast
☆12Sep 22, 2025Updated 9 months ago
adlumal / GoalChain
View on GitHub
GoalChain for goal-orientated LLM conversation flows
☆71Dec 2, 2024Updated last year
stringandstickytape / MaxsAiStudio
View on GitHub
A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.
☆36May 11, 2026Updated 2 months ago
umar-mq / chainlit-rag
View on GitHub
☆30Oct 4, 2024Updated last year
TLAMHutto / ollamaChat
View on GitHub
☆23Sep 27, 2024Updated last year
doubleshow / superlinked
View on GitHub
A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…
☆12Sep 16, 2024Updated last year
Kestrel-Foundry / KestrelAI
View on GitHub
Long-term Research Assistants with Self-Scheduling
☆53Mar 22, 2026Updated 3 months ago
getholly / holly
View on GitHub
Holly - host your own AI coding agent inside docker container. Keep your system safe.
☆23Jul 13, 2026Updated last week
FarFetchd / clickitongue
View on GitHub
Mic-controlled mouse clicks
☆17Oct 6, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
UNHSAILLab / TaCo
View on GitHub
TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes
☆14Jul 1, 2025Updated last year
meetdavidwan / clamr
View on GitHub
CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval
☆26Jun 28, 2025Updated last year
deshwalmahesh / PHUDGE
View on GitHub
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆53Jul 10, 2024Updated 2 years ago
pavanjava / devops-agent
View on GitHub
Agent team for DevOps that solves Kubernetes, Docker, Terraform, and monitoring challenges through intelligent automation.
☆18Jan 21, 2026Updated 6 months ago
The-Pocket / PocketFlow-Zig
View on GitHub
Pocket Flow: A minimalist LLM framework. Let Agents build Agents!
☆16Jan 26, 2026Updated 5 months ago
aiplanethub / beyondllm
View on GitHub
Build, evaluate and observe LLM apps
☆296Jan 27, 2025Updated last year
FSilveiraa / solveig
View on GitHub
An agentic runtime that enables secure, extensible and configurable AI automation from any model
☆17Jul 15, 2026Updated last week