☆124Jan 9, 2026Updated 2 months ago
Alternatives and similar repositories for awesome-small-language-models
Users that are interested in awesome-small-language-models are comparing it to the libraries listed below
Sorting:
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆23Sep 1, 2025Updated 6 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated 2 months ago
- ☆30Aug 27, 2024Updated last year
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆24Dec 15, 2025Updated 2 months ago
- An agentic runtime that enables secure, extensible and configurable AI automation from any model☆17Updated this week
- ☆11Sep 18, 2023Updated 2 years ago
- Official pytorch implementation of the paper: "HomeGAN: Two stage GAN for enhanced floor plan image generation"☆11Aug 9, 2023Updated 2 years ago
- LLM CLI Interface - Extremely Convenient and Fast☆12Sep 22, 2025Updated 5 months ago
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 5 months ago
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆14Jul 1, 2025Updated 8 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Evals that meet you where you are. For AI that's grounded.☆54Feb 6, 2026Updated last month
- A modular framework for building massively parallel agentic systems☆30Sep 8, 2025Updated 6 months ago
- Magface Triton Inferece Server Using Tensorrt☆18Feb 12, 2022Updated 4 years ago
- LLM FX: A LLM Server Desktop Client free for everyone!☆37Updated this week
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year
- minimal C implementation of speculative decoding based on llama2.c☆28Jul 15, 2024Updated last year
- ☆22Aug 9, 2024Updated last year
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Oct 24, 2024Updated last year
- Cross platform GitHub Action to upload multiple assets to a release using Golang☆12Feb 6, 2026Updated last month
- Loader extension for tabbyAPI in SillyTavern☆26Jun 30, 2025Updated 8 months ago
- Network for procedural editing of text with LLMs☆23Dec 6, 2025Updated 3 months ago
- ☆26May 31, 2024Updated last year
- ☆23Sep 27, 2024Updated last year
- An Open-Source Modular AI Assistant☆32Mar 20, 2025Updated 11 months ago
- Easily create LLM automation/agent workflows☆60Feb 13, 2024Updated 2 years ago
- Local drive deep search.☆33Jun 4, 2025Updated 9 months ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT model☆13Sep 25, 2024Updated last year
- Run LLaMA inference on CPU, with Rust 🦀🚀🦙☆35Jan 5, 2025Updated last year
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Feb 11, 2026Updated 3 weeks ago
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆56Feb 24, 2026Updated 2 weeks ago
- Mistral7B playing DOOM☆29Mar 27, 2024Updated last year
- GoalChain for goal-orientated LLM conversation flows☆71Dec 2, 2024Updated last year
- ☆30Oct 4, 2024Updated last year
- A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.☆27Mar 8, 2025Updated last year
- A live multiplayer trivia game where users can bid for the subject of the next question☆29Jan 9, 2026Updated 2 months ago
- Crow is a Desktop AI Assistant☆32Aug 9, 2024Updated last year